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NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
HELICOBACTER PYLORI AND VACCINE COMPOSITIONS THEREOF 

Background of the Invention 
5 Helicobacter pylori is a gram-negative, S-shaped, microaerophilic bacterium that 

was discovered and cultured from a human gastric biopsy specimen. (Warren, J.R. and 
B. Marshall, (1983) Lancet U 1273-1275; and Marshall et al., (1984) Microbios Lett 25: 
83-88). H pylori has been strongly linked to chronic gastritis and duodenal ulcer 
disease. (Rathbone et. al., (1986) Gut 27: 635-641). Moreover, evidence is 
10 accumulating for an etiologic role of H. pylori in nonulcer dyspepsia, gastric ulcer 
-diseaserarid-gastrie-adenGcarcirioma. -(Blaser Mt J.r(i 993)"^^ 

260). Transmission of the bacteria occurs via the oral route, and the risk of infection 
increases with age. (Taylor, D,N. and M. J. Blaser, (1991) Epidemiol, Rev 13: 42-50). 
K pylori colonizes the human gastric mucosa, establishing an infection that usually 

1 5 persists for decades. Infection by H pylori is prevalent worldwide. Developed 
countries have infection rates over 50% of the adult population, while developing 
countries have infection rates reaching 90% of the adults over the age of 20. (Hopkins 
R. J. and J. G. Morris (1994) Am, J. Med 97: 265-277). 

The bacterial factors necessary for colonization of the gastric environment, and 

20 for virulence of this pathogen, are poorly understood. Examples of the putative 
virulence factors include the following: urease, an enzyme that may play a role in 
neutralizing gastric acid pH (Eaton et al., (1991) Infect. Immunol, 59: 2470-2475; 
Ferrero, R.L. and A. Lee ( 1 99 1 ) M*cro6. Ecol. Hlth, Dis, 4; 121-134; Labigne et al., 
(1991) y. Bacterioi 173: 1920-1931); the bacterial flagellar proteins responsible for 

25 motility across the mucous layer. (Hazell et al., (1986) J, Inf, Dis, 153 : 658-663; Leying 
et al., (1992) Moi Microbiol, 6: 2863-2874; and Haas et al., (1993) Mol Microbiol, 8: 
753-760); Vac A, a bacterial toxin that induces the formation of intracellular vacuoles in 
epithelial cells (Schmitt, W. and R. Haas, (1994) Molecular MicrobioL 12(2) : 307-319); 
and several gastric tissue-specific adhesins. (Boren et al., (1993) Science 262 : 1892- 

30 1895; Evans et al., (1993) J. Bacterioi 175: 674-683; and Falk et al., (1993) Proc, Natl, 
Acad, Sci, USA 90: 2035-203). 

Numerous therapeutic agents are currently available that eradicate H. pylori 
infections in vitro. (Huesca et. al., (1993) Zbi Bakt, 280: 244-252; Hopkins, R. J. and J. 
G. Morris, supra). However, many of these treatments are suboptimally effective in vivo 

35 because of bacterial resistance, altered drug distribution, patient non-compliance or poor 
drug availabilty. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics 
combined with bismuth are part of the standard regime used to treat H pylori infection. 
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(Malfertheiner, P. and J. E. Dominguez-Munoz (1993) Clinical Therapeutics \5 Supp. 
B: 37-48). Recently, combinations of a proton pump inhibitors and a single antibiotic 
have been shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E. 
Dominguez-Munoz supra). However, methods employing antibiotic agents can have the 
5 problem of the emergence of bacterial strains which are resistant to these agents. 

(Hopkins, R. J. and J. G. Morris, supra). These limitations demonstrate that new more 
effective methods are needed to combat H. pylori infections in vivo. In particular, the 
design of new vaccines that may prevent infection by this bacterium is highly desirable. 

10 .Summary nf the Invention 

This invention relates to novel genes, e.g., genes encoding polypeptides such as 
bacterial surface proteins, from the organism Helicobacter pylori {H. pylori), and other 
related genes, their products, and uses thereof. The nucleic acids and peptides of the 
present invention have utility for diagnostic and therapeutics for H. pylori and other 

1 5 Helicobacter species. They can also be used to detect the presence of H. pylori and 
other Helicobacter species in a sample; and for use in screening compounds for the 
ability to interfere with the H. pylori life cycle or to inhibit H. pylori infection. More 
specifically, this invention features compositions of nucleic acids corresponding to 
entire coding sequences of//, pylori proteins, including surface or secreted proteins or 

20 parts thereof, nucleic acids capable of binding mRNA from H. pylori proteins to block 
protein translation, and methods for producing H. pylori proteins or parts thereof using 
peptide synthesis and recombinant DNA techniques. This invention also features 
antibodies and nucleic acids useful as probes to detect H. pylori infection. In addition, 
vaccine compositions and methods for the protection or treatment of infection by H. 

25 pylori are within the scope of this invention. 



Detailed Description of the Drawings 

Figure 1 is a bar graph that depicts the antibody titer in serum of mice following 
immunization with specific H. pylori antigens. 
30 Figure 2 is a bar graph that depicts the antibody titer in mucous of mice 

following immunization with specific H. pylori antigens. 

Figure 3 is a bar graph that depicts therapeutic immunization of H. pylori 
infected mice with specific antigens dissolved in HEPES buffer. 

Figure 4 is a bar graph that depicts therapeutic immunization of H. pylori 
35 infected mice with specific antigens dissolved in buffer containing DOC. 
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Figure 5 depicts the amino acid sequence alignment in a portion of the sequence 
of five K pylori proteins (depicted in the single letter amino acid code; shown N- 
terminal to C-terminal, left to right). 

Figure 6 depicts the amino acid sequence alignment in a portion of the sequence 
5 of four K pylori proteins (depicted in the single letter amino acid code; shown N- 
terminal to C-terminal, left to right). 

Figure 7 depicts the amino acid sequence alignment in a portion of the sequence 
of two K pylori proteins (depicted in the single letter amino acid code; shown N- 
terminal to C-terminal, left to right). 
10 Figure 8 depicts the amino acid sequence alignment in a portion of the sequence 

of two H^py/or? proteins (depicted in the single letter amino acid code; shown N- 
terminal to C-terminal, left to right). 

Detailed Description of the Invention 

15 In one aspect, the invention features a recombinant or substantially pure 

preparation of//, pylori polypeptide of SEQ ID NO: 74. The invention also includes 
substantially pure nucleic acid encoding an //. pylori polypeptide of SEQ ID NO: 74, 
such nucleic acid is contained in SEQ ID NO: 1. The K pylori polypeptide sequences 
of the invention described herein are contained in the Sequence Listing, and the nucleic 

20 acids encoding H. pylori polypeptides of the invention are contained in the Sequence 
Listing. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 75, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 2. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 76, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO; 3. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 77, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 4. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 78, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 5. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 79, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 6. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 80, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 7. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 8 1 , 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 8. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 82, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 9. 
0 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 83, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 10. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. ;7y/or/ polypeptide having an amino acid sequence of SEQ ID NO: 84, 
5 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 1 . 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 85, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 12. 

In anodier aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 86, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 13. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 87, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 14. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 88, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 5. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 89, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 6. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 90, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 17. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 9 1 , 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 18. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 92, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 19. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 93, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 20. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 94, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 21 . 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding ^ H^p^y/or/ polypeptide^^ an amino acid sequence of SEQ ID NO: 95, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 22, 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 96, 
15 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 23. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 97, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 24. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 98, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 25. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 99, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 26. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 100, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 27. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 101, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 28, 

In another aspect, the invention features a substantially pure nucleic acid 
encoding H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 102, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 29. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 103, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 30. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 104, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 31, 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 105, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 32. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 106, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 33. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 107, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 34, 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 108, 
15 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 35. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 109, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 36. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 110, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 37. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 111, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 38. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 12, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 39. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 113, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 40. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 14, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 41. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 115, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 42. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 116, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO:43. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 117, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 44. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 118, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 45. 

In another aspect, the invention features a substantially pure nucleic acid 
"encoding an //.^p57/on^lypeptiae"Kaving an amino acid sequence of SEQ ID NO: 1 19, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 46. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 120, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 47. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 121, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 48. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 122, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 49. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 123, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO; 50. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 124, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 51. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 125, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 52. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 126, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 53. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 127, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 54. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 128, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 55, 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 129, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 56. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 130, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 57. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 131, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 58. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 132, 
15 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 59. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 133, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 60. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 134, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 61. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 135, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 62. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 136, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 63. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 137, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 64. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 138, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 65. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 139, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 66. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 140, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 67. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 141, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 68. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 142, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 69. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 143, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 70. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 144, 
15 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 71. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 145, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 72. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 146, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 73. 

Particularly perferred is an isolated nucleic acid comprising a nucleotide 
sequence encoding an H, pylori cell envelope polypeptide or a fragment thereof. Such 
nucleic acid is selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 25, 
25 SEQ ID NO: 48, SEQ ID NO: 1 6, SEQ ID NO: 10, SEQ ID NO: 45, SEQ ID NO: 35, 
SEQ ID NO: 37, SEQ ID NO: 7, SEQ ID NO: 39, SEQ ID NO: 55, SEQ ID NO: 18, 
SEQ ID NO: 19, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 52, SEQ ID NO: 54, 
SEQ ID NO: 56, SEQ ID NO: 58, SEQ ID NO: 1, SEQ ID NO: 42, SEQ ID NO: 14, 
SEQ ID NO: 43, SEQ ID NO: 1 1, SEQ ID NO: 71, SEQ ID NO: 17, SEQ ID NO: 57, 
30 SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, and SEQ ID NO: 2 1 . 

In another embodiment, the K pylori cell envelope polypeptide or a fragment 
thereof is an //. pylori inner membrane polypeptide or a fragment thereof encoded by the 
nucleic acid selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 25, and 
SEQ ID NO: 48. 

35 In another embodiment, the H. pylori cell envelope polypeptide or a fragment 

thereof is an H. pylori outer membrane polypeptide or a fragment thereof encoded by the 
nucleic acid selected from the group consisting of SEQ ID NO: 16, SEQ ID NO: 10, 
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SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 7, SEQ ID NO: 39, 
SEQ ID NO: 55, SEQ ID NO: 1 8, SEQ ID NO: 19, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, SEQ ID NO: 58, SEQ ID NO: 1, 
SEQ ID NO: 42, SEQ ID NO: 14, SEQ ID NO: 43, SEQ ID NO: 1 1, and SEQ ID NO: 
5 71. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C- 
terminal tyrosine cluster or a fragment thereof encoded by the nucleic acid selected from 
the group consisting of SEQ ID NO: 1, SEQ ID NO: 42, SEQ ID NO: 14, SEQ ID NO: 

10 43, SEQ ID NO: 11, and SEQ ID NO:71. 

In yet another embodiment, the H. pylori outer membrane polypeptide or a 
fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue or 
a fragment thereof encoded by the nucleic acid selected from the group consisting of 
SEQ ID NO: 16, SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 7, 

15 SEQ ID NO: 39, SEQ ID NO: 55, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 28, 
SEQ ID NO: 30, SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, and SEQ ID NO: 
58. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide 
sequence encoding an H. pylori secreted polypeptide or a fragment thereof Such 

20 nucleic acid is selected from the group consisting of SEQ ID NO: 72, SEQ ID NO: 32, 
SEQ ID NO: 51, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 9, SEQ ID NO: 13, SEQ 
ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 3 1, SEQ ID NO: 33, SEQ ID NO: 34, SEQ 
ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 40. SEQ ID NO: 41, SEQ ID NO: 44, SEQ 
ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 53, SEQ ID NO: 59, SEQ ID NO: 61, SEQ 

25 ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, and 
SEQ ID NO: 68. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide 
sequence encoding an H. pylori cellular polypeptide or a fragment thereof Such nucleic 
acid is selected from the group consisting of SEQ ID NO: 12, SEQ ID NO: 1 5, SEQ ID 

30 NO: 20, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID 
NO: 47, SEQ ID NO: 50, SEQ ID NO: 60, SEQ ID NO: 64, SEQ ID NO: 69, SEQ ID 
NO: 70, and SEQ ID NO: 73. 

Particularly preferred is a purified or isolated H. pylori cell envelope polypeptide 
or a fragment thereof, wherein the polypeptide is selected from the group consisting of 

35 SEQ ID NO: 76, SEQ ID NO: 98, SEQ ID NO: 121, SEQ ID NO: 89, SEQ ID NO: 83, 
SEQ ID NO: 118, SEQ ID NO: 108, SEQ ID NO: 110, SEQ ID NO: 80, SEQ ID NO: 
1 12, SEQ ID NO: 128, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 101, SEQ ID 
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NO: 103, SEQ ID NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 131, SEQ 
ID NO: 74, SEQ ID NO: 115, SEQ ID NO: 87, SEQ ID NO: 1 1 6, SEQ ID NO: 84, SEQ 
ID NO: 144, SEQ ID NO: 90, SEQ ID NO: 130, SEQ ID NO: 78, SEQ ID NO: 79, SEQ 
ID NO: 81, and SEQ ID NO: 94. 
5 In another embodiment, the H. pylori cell envelope polypeptide or a fragment 

thereof is an H pylori inner membrane polypeptide or a fragment thereof selected from 
the group consisting of SEQ ID NO: 76, SEQ ID NO: 98, and SEQ ID NO: 121. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori outer membrane polypeptide or a fragment thereof selected from 

10 the group consisting of SEQ ID NO: 89, SEQ ID NO: 83, SEQ ID NO: 1 1 8, SEQ ID 
NO: 108, SEQ ID NO: 110, SEQ ID NO:"80, SEQ ID NO: ri2,"SEQ ID NOri28, SEQ 
ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 125, 
SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 131, SEQ ID NO: 74, SEQ ID NO: 
115, SEQ ID NO: 87, SEQ ID NO: 1 16, SEQ ID NO: 84, SEQ ID NO: 144, SEQ ID 

1 5 NO: 90, and SEQ ID NO: 130. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C- 
terminal tyrosine cluster or a fragment thereof selected from the group consisting of 
SEQ ID NO: 74, SEQ ID NO: 115, SEQ ID NO: 87, SEQ ID NO: 1 16, SEQ ID NO: 84, 

20 and SEQ ID NO: 144. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a 
fragment thereof selected from the group consisting of SEQ ID NO: 89. SEQ ID NO: 
1 18, SEQ ID NO: 108, SEQ ID NO: 1 10, SEQ ID NO: 80, SEQ ID NO: 1 12, SEQ ID 

25 NO: 1 28, SEQ ID NO: 9 1 , SEQ ID NO: 92, SEQ ID NO: 1 0 1 , SEQ ID NO: 1 03, SEQ 
ID NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 131. 

Particularly preferred is a purified or isolated H. pylori secreted polypeptide or a 
fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ 
ID NO: 145, SEQ ID NO: 105, SEQ ID NO: 124, SEQ ID NO: 75, SEQ ID NO: 77, 

30 SEQ ID NO: 82, SEQ ID NO: 86, SEQ ID NO: 95, SEQ ID NO: 102, SEQ ID NO: 104, 
SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 109, SEQ ID NO: 1 1 1, SEQ ID NO: 
113, SEQ ID NO: 114, SEQ ID NO: 117, SEQ ID NO: 119, SEQ ID NO: 122, SEQ ID 
NO: 126, SEQ ID NO: 132, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ 
ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, and SEQ ID NO: 141. 

35 P2irticularly preferred is a purified or isolated H. pylori cellular polypeptide or a 

fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ 
ID NO: 85, SEQ ID NO: 88, SEQ ID NO: 93, SEQ ID NO: 96, SEQ ID NO: 97, SEQ 
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ID NO: 99, SEQ ID NO: 100. SEQ ID NO: 120, SEQ ID NO: 123, SEQ ID NO: 133, 
SEQ ID NO: 137, SEQ ID NO: 142, SEQ ID NO: 143, and SEQ ID NO: 146. 

In another aspect, the invention pertains to any individual H. pylori polypeptide 
member or nucleic acid encoding such a member from the above-identified groups of K 

5 pylori polypeptides. 

In another aspect, the invention features nucleic acids capable of binding mRNA 
of H, pylori. Such nucleic acid is capable of acting as antisense nucleic acid to control 
the translation of mRNA of//, pylori. A further aspect features a nucleic acid which is 
capable of binding specifically to an H. pylori nucleic acid. These nucleic acids are also 
10 referred to herein as complements and have utility as probes and as capture reagents. 

In another aspect, the invention features an expression system comprising an 
open reading frame corresponding to K pylori nucleic acid. The nucleic acid further 
comprises a control sequence compatible v^ith an intended host. The expression system 
is useful for making polypeptides corresponding to H. pylori nucleic acid. 
15 In another aspect, the invention features a cell transformed with the expression 

system to produce H. pylori polypeptides. 

In another aspect, the invention features a method of generating antibodies 
against H. pylori polypeptides which are capable of binding specifically to H. pylori 
polypeptides. Such antibodies have utility as reagents for immunoassays to evaluate the 
20 abundance and distribution of H. pylori-specific antigens. 

In another aspect, the invention features a method of generating vaccines for 
immunizing an individual against K pylori. The vaccination method includes: 
immunizing a subject with at least one H. pylori polypeptide according to the present 
invention, e.g., a surface or secreted polypeptide, or active portion thereof, and a 
25 pharmaceutically acceptable carrier. Such vaccines have therapeutic and/or prophylactic 
utilities. 

In another aspect, the invention provides a method for generating a vaccine 
comprising a modified immunogenic K pylori polypeptide, e.g., a surface or secreted 
polypeptide, or active portion thereof, and a pharmacologically acceptable earner. 

30 In another aspect, the invention features a method of evaluating a compound, e.g. 

a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an K 
pylori polypeptide. The method includes: contacting the candidate compound with an 
K pylori polypeptide and determining if the compound binds or otherwise interacts with 
an H. pylori polypeptide. Compounds which bind K pylori are candidates as activators 

35 or inhibitors of the bacterial life cycle. These assays can be performed in vitro or in 
vivo. 
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In another aspect, the invention features a method of evaluating a compound, e.g. 
a polypeptide, e.g., a fragment of a host ceil polypeptide, for the ability to bind an H. 
pylori nucleic acid, e.g., DNA or RNA. The method includes: contacting the candidate 
compound with an K pylori nucleic acid and determining if the compound binds or 
otherwise interacts with an K pylori polypeptide. Compounds which bind K pylori are 
candidates as activators or inhibitors of the bacterial life cycle. These assays can be 
performed in vitro or in vivo. 

The invention features //. pylori polypeptides, preferably a substantially pure 
preparation of an H. pylori polypeptide, or a recombinant H. pylori polypeptide. In 
preferred embodiments: the polypeptide has biological activity; the polypeptide has an 
amino acid-sequence at-least 60%770%-,-80yor90%,-95"^^^^^ 

homologous to an amino acid sequence of the invention contained in the Sequence 
Listing, preferably it has about 65% sequence identity with an amino acid sequence of 
the invention contained in the Sequence Listing, and most preferably it has about 92% to 
about 99% sequence identity with an amino acid sequence of the invention contained in 
the Sequence Listing; the polypeptide has an amino acid sequence essentially the same 
as an amino acid sequence of the invention contained in the Sequence Listing; the 
polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acid residues in length; the 
polypeptide includes at least 5, preferably at least 10, more preferably at least 20, more 
preferably at least 50, 100, or 150 contiguous amino acid residues of the invention 
contained in the Sequence Listing. In yet another preferred embodiment, the amino acid 
sequence which differs in sequence identity by about 7% to about 8% from the K pylori 
amino acid sequences of the invention contained in the Sequence Listing is also 
encompassed by the invention. 

In preferred embodiments: the K pylori polypeptide is encoded by a nucleic 
acid of the invention contained in the Sequence Listing, or by a nucleic acid having at 
least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid of the 
invention contained in the Sequence Listing, 

In a preferred embodiment, the subject H. pylori polypeptide differs in amino 
acid sequence at 1 , 2, 3, 5, 1 0 or more residues from a sequence of the invention 
contained in the Sequence Listing. The differences, however, are such that the K pylori 
polypeptide exhibits an K pylori biological activity, e.g., the K pylori polypeptide 
retains a biological activity of a naturally occurring H. pylori polypeptide. 

In preferred embodiments, the polypeptide includes all or a fragment of an amino 
acid sequence of the invention contained in the Sequence Listing; fused, in reading 
frame, to additional amino acid residues, preferably to residues encoded by genomic 
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DNA 5' or 3' to the genomic DNA which encodes a sequence of the invention contained 

in the Sequence Listing. 

In yet other preferred embodiments, the H. pylori polypeptide is a recombmant 
fusion protein having a first H. pylori polypeptide portion and a second polypeptide 
5 portion e.g., a second polypeptide portion having an amino acid sequence umrelated to 
H. pylori. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, 
a DNA binding domain, or a polymerase activating domain. In preferred embodmient 
the fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
10 transcription events, alternative RNA splicing events, and alternative translational and 

postranslational events. 

The invention also encompasses an immunogenic component which includes at 
least one H. pylori polypeptide in an immunogenic preparation; the immunogenic 
component being capable of eliciting an immune response specific for the H. pylori 

1 5 polypeptide, e.g., a humoral response, an antibody response, or a cellular response. In 
preferred embodiments, the immunogenic component comprises at least one antigemc 
determinant from a polypeptide of the invention contained in the Sequence Listing. 

In another aspect, the invention provides a substantially pure nucleic acid having 
a nucleotide sequence which encodes an H. pylori polypeptide. In preferred 

20 embodiments: the encoded polypeptide has biological activity; the encoded polypeptide 
has an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% 
homologous to an amino acid sequence of the invention contained in the Sequence 
Listing; the encoded polypeptide has an amino acid sequence essentially the same as an 
amino acid sequence of the invention contained in the Sequence Listing; the encoded 

25 polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded 

polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more 
preferably at least 50, 100, or 150 contiguous amino acids of the invention contained in 

the Sequence Listing. 

In preferred embodiments: the nucleic acid of the invention is that contained in 
30 the Sequence Listing; the nucleic acid is at least 60%, 70%, 80%, 90%, 95%, 98%, or 
99% homologous with a nucleic acid sequence of the invention contained m the 
Sequence Listing. 

In a preferred embodiment, the encoded H. pylori polypeptide differs (e.g., by 
amino acid substitution, addition or deletion of at least one amino acid residue) in amino 
35 acid sequence at I, 2, 3, 5, 10 or more residues, from a sequence of the invention 
contained in the Sequence Listing. The differences, however, are such that: the H. 
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pylori encoded polypeptide exhibits a K pylori biological activity, e.g., the encoded H, 
pylori enzyme retains a biological activity of a naturally occurring H, pylori. 

In preferred embodiments, the encoded polypeptide includes all or a fragment of 
an amino acid sequence of the invention contained in the Sequence Listing; fused, in 
5 reading frame, to additional amino acid residues, preferably to residues encoded by 
genomic DNA 5' or 3* to the genomic DNA which encodes a sequence of the invention 
contained in the Sequence Listing. 

In preferred embodiments, the subject H, pylori nucleic acid will include a 
transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
10 transcriptional enhancer sequence, operably linked to the H. pylori gene sequence, e.g., 
to render the K pylori gene sequence suitable for expression in a recombinant host cell. 

In yet a further preferred embodiment the nucleic acid which encodes an H. 
pylori polypeptide of the invention, hybridizes under stringent conditions to a nucleic 
acid probe corresponding to at least 8 consecutive nucleotides of the invention contained 
15 in the Sequence Listing; more preferably to at least 12 consecutive nucleotides of the 
invention contained in the Sequence Listing; more preferably to at least 20 consecutive 
nucleotides of the invention contained in the Sequence Listing; more preferably to at 
least 40 consecutive nucleotides of the invention contained in the Sequence Listing. 

In a preferred embodiment, the nucleic acid encodes a peptide which differs by at 
20 least one amino acid residue from the sequences of the invention contained in the 
Sequence Listing. 

In a preferred embodiment, the nucleic acid differs by at least one nucleotide 
from a nucleotide sequence of the invention contained in the Sequence Listing which 
encodes amino acids of the invention contained in the Sequence Listing. 

25 In another aspect, the invention encompasses: a vector including a nucleic acid 

which encodes an H, pylori polypeptide or an H. pylori polypeptide variant as described 
herein; a host cell transfected with the vector; and a method of producing a recombinant 
H. pylori polypeptide or //. pylori polypeptide variant; including culturing the cell, e.g., 
in a cell culture medium, and isolating the K pylori or K pylori polypeptide variant, 

30 e.g., from the cell or from the cell culture medium. 

In another aspect, the invention features, a purified recombinant nucleic acid 
having at least 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a 
sequence of the invention contained in the Sequence Listing. 

The invention also provides a probe or primer which includes a substantially 

35 purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence 
which hybridizes under stringent conditions to at least 8 consecutive nucleotides of 
sense or antisense sequence of the invention contained in the Sequence Listing, or 
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naturally occurring mutants thereof. In preferred embodiments, the probe or primer 
further includes a label group attached thereto. The label group can be, e.g., a 
radioisotope, a fluorescent compound, an enzyme, and/or an enzyme co-factor. 
Preferably the oligonucleotide is at least 8 and less than 10, 20, 30, 50, 100, or 150 
5 nucleotides in length. 

The invention also provides an isolated K pylori polypeptide which is encoded 
by a nucleic acid which hybridizes under stringent hybridization conditions to a nucleic 
acid contained in the Sequence Listing. 

The invention further provides nucleic acids, e.g., RNA or DNA, encoding a 
10 polypeptide of the invention. This includes double stranded nucleic acids as well as 
coding and antisense single strands. 

The K pylori strain, from which genomic sequences have been sequenced, has 
been deposited in the American Type Culture Collection (ATCC # 55679; deposited by 
Genome Therapeutics Corporation, 100 Beaver Street, Waltham, MA 02154) as strain 
15 HP-J99. 

Included in the invention are: allelic variations; natural mutants; induced 
mutants; proteins encoded by DNA that hybridizes under high or low stringency 
conditions to a nucleic acid which encodes a polypeptide of the invention contained in 
the Sequence Listing (for definitions of high and low stringency see Current Protocols in 

20 Molecular Biology, John Wiley & Sons, New York, 1989, 6.3.1 - 6.3.6 and 6.4.1-6.4.10, 
hereby incorporated by reference); and, polypeptides specifically bound by antisera to K 
pylori polypeptides, especially by antisera to an active site or binding domain of//. 
pylori polypeptide. The invention also includes fragments, preferably biologically 
active fragments. These and other polypeptides are also referred to herein as K pylori 

25 polypeptide analogs or variants. 

Putative functions have been determined for several of the H. pylori polypeptides 
of the invention, as shown in Table 1. 

Accordingly, uses of the claimed K pylori polypeptides based on these identified 
functions, as well as other functions as described herein, are also within the scope of the 

30 invention. 

In addition, the present invention encompasses // pylori polypeptides 
characterized as shown in Table 1 below, including: H. pylori cell envelope proteins, K 
pylori secreted proteins, and K pylori cellular proteins. Members of these groups were 
identified by BLAST homology searches and by searches for secretion signal or 
3 5 transmembrane protein motifs. Polypeptides related by significant homology to the 
polypeptides of Table 1 are also considered to be classified in the manner of the 
homologs shown in Table 1 . 
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TABLE 1 



ORF_Name and Group 


nt SeqID 


aa SeqID 


A. CELL ENVELOPE 






A.1 Inner membrane proteins 






02ge1 1622_23494043_f1_6 


3 


76 


hp5p1 521 2_1 3095752_c3_36 


25 


98 


06ep30223_20173437_f1_37 


48 


121 


A.2 Outer membrane proteins 






05ee1 081 6_1 4495437__f2_1 3 


10 


83 


A.2,1 Terminal phe residue 






06ep1i 509~359b4 /b:^_t2_1 


16 


89 


06ep10615 14495437_f3_47 


45 


118 


03ae10804 14495437 c2_38 


35 


108 


05ae30220_91 7200_c3_1 72 


37 


110 


04cp 1 1^Uz^Z0b4DO<30_rz__^D 


7 


OVJ 


uoep I uoi o_ 1 o 1 oi y4tD__^cz_y / 




1 19 


U^CpolvJUO OoDUO/ '_Ti£_Z*5 




19fl 


u^ae 1 uo 1 z^^o f Do co^D f 


1 A 
1 0 


Q1 

*3 1 


nQ^i-*i 1 noQ *^RRnfl77 fi v 
uycpn iuuo ooouo/ / ^^^^f 


1Q 


q? 


npoe iZido/ ou*t f oDo^ lo oo 


9fl 


101 












125 


nQr*nRinn'^ '^n47R'^fi9 r'^ lOfi 




127 


llaoftnftlft in'^^'^1Q9 f9 1fi 




129 


1l£a#a114nft in*^R4'SR9 r*^ 


58 


131 


A 9 9 X^rminal nh^ r^^ifiup anrl 
^^.^.^ 1 d 1 1 III loi ^1 ic 1 coiuuc ai ivj w 

terminal tyrosine cluster 






0 1 ae 1 200 1 _1 1 60 1 B_c2_40 


1 


74 


06ap10609_1 16018_c3_50 


42 


115 


06cp30603_4687507_f1_9 


14 


87 


06cp30603_4687507_f1_7 


43 


116 


05ee1 08 1 6_361 26938_f3_1 6 


11 


84 


01cp20708_4960952_c1_43 


71 


144 


A.3 Via homolgy 






07ap80601.50831 93_f3_8 


17 


90 


11ap20714_4797137_f3_45 


57 


130 


A.4 Other cell envelope proteins 






04ap 1 201 6_25501 50 1 _f 1_1 


5 


78 


04cp1 1202_20415937_f2_25 


6 


79 


04ee1 1 108_3906963_f1_7 


8 


81 


29ep1 0720^25501 50 1_c2_33 


21 


94 


B. SECRETED PROTEINS 






hp3e1 0342_22448587_c2_1 5 


72 


145 


hp5p15212_24276587_f1_2 


32 


105 


09ce1 041 3_35336707_f2_9 


51 


124 
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01ae12001_32462543_c2_43 


o 
Z 


f o 


03ee 1121 5_1 41631 2_c3_35 


A 
H 


77 


05ae30220_1 4570443_c2_94 


Q 




06cp30603_2772578_c1_46 


1 O 


Ou 


29ep 1 0720_289077_f2_1 2 




yo 


03ee1 1215_22542803_f1_7 


zy 


1 


09ae10512_3166040_c1_40 




1 HA 


01ce1 1 104_10742963_c2_12 


33 


1 UD 


02ge1 01 1 6_36335436_f3_66 


34 




04ep41903_1 1876461_f1_4 


36 




05ce10208_23631292_f1_6 


38 


'1 4 

m 


OSepI 081 5_22447252_c3_1 1 0 


40 


113 


05ep1 081 5_3028351 6__c3_1 09 


41 


■4 «t >1 

114 


06ee30709_33851038_c3_30 


44 


11 / 


06ep1 1202_21687842.c3_35 


46 


119 


06ep30223_2774062 J1 _33 


49 


122 


09cp 1 07 1 3_239 1 2707_c1 _26 


53 


12d 


Heel 1408_488231 8_f3_24 


59 


132 


hp4e1 3394_5908553 J1_1 


61 


134 


hp4e53394_1416312_c3_119 


62 


'toe 

135 


hp5e1 52 1 1 _243289 1 0_c3_38 


63 


13o 


hp6p10606_23493756_c1_21 


65 


loo 


hp6p222 1 7_235640 1 2J 1 _5 


66 


1 39 


hp6p22217_272058J1_2 


67 


140 


h p6p222 1 7.2922 1 43_f2_9 


68 


•1 >i 
141 


C. OTHER CELLULAR PROTEINS 






06ap1 1 1 19_14726542_f3_21 


12 


oD 


06ee1 0709^61 36430__c1„1 1 


15 


QR 

oo 


12ap10605_14094816_c1_5 


20 


y»j 


hp2p1 0272_3404261 8_f1_2 


23 


yo 


hp5e1 521 1_2541 1 557_c1_22 


24 


Q7 

y f 


hn*^n1'^R41 '^^07968 f1 3 


26 


99 


hp6e1 0967_657638_f3_9 


27 


100 


06ep1 1202_4569693_c2_28 


47 


120 


06ep30223_3930468_c1_1 10 


50 


123 


hp2e1091 1_960952_c2_86 


60 


133 


hp6p1 0509^1 464221 7_c2_1 7 


64 


137 


hp6p80503_20964382 J2_1 1 


69 


142 


hp7e1 01 92.591 7593_f 1_2 


70 


143 


hp6p1 0509.1464221 7.c3_25 


73 


146 



[In Table 1, "nt" represents nucleotide Seq. ID number and "aa" represents amino acid 
Seq. ID number] 

5 Definitions 

The terms "purified polypeptide" and "isolated polypeptide" and "a substantially 
pure preparation of a polypeptide" are used interchangeably herein and, as used herein. 
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from other proteins, lipids, and nucleic acids with which it naturally occurs. Preferably, 
the polypeptide is also separated from substances, e.g., antibodies or gel matrix, e.g., 
polyacrylamide, which are used to purify it. Preferably, the polypeptide constitutes at 
least 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the 
5 preparation contains: sufficient polypeptide to allow protein sequencing; at least 1,10, 
or 100 ixg of the polypeptide; at least 1, 10, or 100 mg of the polypeptide. Furthermore, 
the terms "purified polypeptide" and "isolated polypeptide" and "a substantially pure 
preparation of a polypeptide," as used herein, refer to both a polypeptide obtained from 
nature or produced by recombinant DNA techniques as described herein. 

10 For example, an "isolated" or "purified" protein or biologically active portion 

thereof is substantially free of cellular material or other contaminating proteins from the 
cell or tissue source from which the H. pylori protein is derived, or substantially free 
from chemical precursors or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" includes prepzirations ofK pylori 

1 5 protein in which the protein is separated from cellular components of the cells from 
which it is isolated or recombinantly produced. In one embodiment, the language 
"substantially free of cellular material" includes preparations of H. pylori protein having 
less than about 30% (by dry weight) of non-//. pylori protein (also referred to herein as a 
"contaminating protein"), more preferably less than about 20% of non-/f. pylori protein, 

20 still more preferably less than about 10% of non-//. pylori protein, and most preferably 
less than about 5% non-//. pylori protein. When the // pylori protein or biologically 
active portion thereof is recombinantly produced, it is also preferably substantially free 
of culture medium, i.e., culture medium represents less than about 20%, more preferably 
less than about 10%, and most preferably less than about 5% of the volume of the 

25 protein prepeiration. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of K pylori protein in which the protein is separated from 
chemical precusors or other chemicals which are involved in the synthesis of the protein. 
In one embodiment, the language "substantially free of chemiczd precursors or other 

30 chemicals" includes preparations of H. pylori protein having less than about 30% (by dry 
weight) of chemical precursors or non-// pylori chemicals, more preferably less than 
about 20% chemical precursors or non-// pylori chemicals, still more preferably less 
than about 1 0% chemical precursors or non-// pylori chemicals, and most preferably 
less than about 5% chemical precursors or non-// pylori chemicals. 

35 A purified preparation of cells refers to, in the case of plant or animal cells, an in 

vitro preparation of cells and not an entire intact plant or animal. In the case of cultured 
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cells or microbial cells, it consists of a preparation of at least 10% and more preferably 
50% of the subject cells. 

A purified or isolated or a substantially pure nucleic acid, e.g., a substantially 
pure DNA, (are terms used interchangeably herein) is a nucleic acid which is one or both 
5 of the following: not immediately contiguous with both of the coding sequences with 
which it is immediately contiguous (i.e., one at the 5* end and one at the 3' end) in the 
naturally-occurring genome of the organism from which the nucleic acid is derived; or 
which is substantially free of a nucleic acid with which it occurs in the organism from 
which the nucleic acid is derived. The term includes, for example, a recombinant DNA 

10 which is incorporated into a vector, e.g., into an autonomously replicating plasmid or 
virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a 
separate molecule (e.g., a cDNA or a genomic DNA fragment produced by PGR or 
restriction endonuclease treatment) independent of other DNA sequences. Substantially 
pure DNA also includes a recombinant DNA which is part of a hybrid gene encoding 

1 5 additional H, pylori DNA sequence. 

A "contig" as used herein is a nucleic acid representing a continuous stretch of 
genomic sequence of an organism. 

An "open reading frame", also referred to herein as ORF, is a region of nucleic 
acid which encodes a polypeptide. This region may represent a portion of a coding 

20 sequence or a total sequence and can be determined from a stop to stop codon or from a 
start to stop codon. 

As used herein, a "coding sequence" is a nucleic acid which is transcribed into 
messenger RNA and/or translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
25 determined by a translation start codon at the five prime terminus and a translation stop 
code at the three prime terminus. A coding sequence can include but is not limited to 
messenger RNA, synthetic DNA, and recombinant nucleic acid sequences. 

A "complement" of a nucleic acid as used herein referes to an anti-parallel or 
antisense sequence that participates in Watson-Crick base-pairing with the original 
30 sequence. 

A "gene product" is a protein or structural RNA which is specifically encoded by 

a gene. 

As used herein, the term "probe" refers to a nucleic acid, peptide or other 
chemical entity which specifically binds to a molecule of interest. Probes are often 
35 associated with or capable of associating with a label. A label is a chemical moiety 
capable of detection. Typical labels comprise dyes, radioisotopes, luminescent and 
chemiluminescent moieties, fluorophores, enzymes, precipitating agents, amplification 
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sequences, and the like. Similarly, a nucleic acid, peptide or other chemical entity which 
specifically binds to a molecule of interest and immobilizes such molecule is referred 
herein as a "capture ligand". Capture ligands are typically associated with or capable of 
associating with a support such as nitro-cellulose, glass, nylon membranes, beads, 
5 particles and the like. The specificity of hybridization is dependent on conditions such 
as the base pair composition of the nucleotides, and the temperature and salt 
concentration of the reaction. These conditions are readily discemable to one of 
ordinary skill in the art using routine experimentation. 

Homologous refers to the sequence similarity or sequence identity between two 
1 0 polypeptides or between two nucleic acid molecules. When a position in both of the two 
compared sequences is occupied by the same base or amino acid monomer subunit, e.g., 
if a position in each of two DNA molecules is occupied by adenine, then the molecules 
are homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two 
1 5 sequences divided by the number of positions compared x 1 00. For example, if 6 of 10 

of the positions in two sequences are matched or homologous then the two sequences are ' 
60% homologous. By way of example, the DNA sequences ATTGCC and TATGGC 
share 50% homology. Generally, a comparison is made when two sequences are aligned 
to give maximum homology. 

20 Nucleic acids are hybridizable to each other when at least one strand of a nucleic 

acid can anneal to the other nucleic acid under defined stringency conditions. 
Stringency of hybridization is determined by: (a) the temperature at which hybridization 
and/or washing is performed; and (b) the ionic strength and polarity of the hybridization 
and washing solutions. Hybridization requires that the two nucleic acids contain 

25 complementary sequences; depending on the stringency of hybridization, however, 
mismatches may be tolerated. Typically, hybridization of two sequences at high 
stingency (such as, for example, in a solution of 0,5X SSC, at 65° C) requires that the 
sequences be essentially completely homologous. Conditions of intermediate stringency 
(such as, for example, 2X SSC at 65 ° C) and low stringency (such as, for example 2X 

30 SSC at 55° C), require correspondingly less overall complementarity between the 

hybridizing sequences. (IX SSC is 0.15 M NaCl, 0.015 M Na citrate). A preferred, 
non-limiting example of stringent hybridization conditions are hybridization in 6X 
sodium chloride/sodium citrate (SSC) at about 45^C, followed by one or more washes in 
0.2 X SSC, 0.1% SDS at 50-65OC. 

35 The terms peptides, proteins, and polypeptides are used interchangeably herein. 
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As used herein, the term '* surface protein" refers to all surface accessible 
proteins, e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and 
secreted proteins. 

A polypeptide has K pylori biological activity if it has one, two and preferably 
5 more of the following properties: (1) if when expressed in the course of an //. pylori 
infection, it can promote, or mediate the attachment of H. pylori to a cell; (2) it has an 
en2ymatic activity, structural or regulatory function characteristic of an K pylori 
protein; (3) the gene which encodes it can rescue a lethal mutation in an H. pylori gene; 
(4) or it is immunogenic in a subject. A polypeptide has biological activity if it is an 
10 antagonist, agonist, or super-agonist of a polypeptide having one of the above-listed 
properties. 

A biologically active fragment or analog is one having an in vivo or in vitro 
activity which is characteristic of the H, pylori polypeptides of the invention contained 
in the Sequence Listing, or of other naturally occurring K pylori polypeptides, e.g., one 

15 or more of the biological activities described herein. Especially preferred are fragments 
which exist in vivo, e.g., fragments which arise from post transcriptional processing or 
which arise from translation of alternatively spliced RNAs. Fragments include those 
expressed in native or endogenous cells as well as those made in expression systems, 
e.g., in CHO cells. Because peptides such as K pylori polypeptides often exhibit a 

20 range of physiological properties and because such properties may be attributable to 

different portions of the molecule, a useftil K pylori fragment or K pylori analog is one 
which exhibits a biological activity in any biological assay for K pylori activity. Most 
preferably the fragment or analog possesses 10%, preferably 40%, more preferably 60%, 
70%, 80% or 90% or greater of the activity of K pylori, in any in vivo or in vitro assay. 

25 Analogs can differ from naturally occurring K pylori polypeptides in amino acid 

sequence or in ways that do not involve sequence, or both. Non-sequence modifications 
include changes in acetylation, methylation, phosphorylation, carboxylation, or 
glycosylation. Preferred analogs include K pylori polypeptides (or biologically active 
fragments thereof) whose sequences differ from the wild-type sequence by one or more 

30 conservative amino acid substitutions or by one or more non-conservative amino acid 
substitutions, deletions, or insertions which do not substantially diminish the biological 
activity of the K pylori polypeptide. Conservative substitutions typically include the 
substitution of one amino acid for another with similar characteristics, e.g., substitutions 
within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, 

35 leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, 
arginine; and phenylalanine, tyrosine. Other conservative substitutions can be made in 
view of the table below. 
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TABLE 2 

CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, He, 
D-Met, D-Ile, Om, D-Orn 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-Gln 


"Aspartie-Aeid 


-B 


-D-AsprE>-AsnrAsn,-GlurD=GlurGInrB=GIn- 


Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


Glutamine 


Q 


D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-Gln 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro, p-Ala, Acp 


Isoleucine 


I 


D-Ile, Val, D-Val, Leu, D-Leu, Met, D-Met 


Leucine 


T 


r> T „, Vol O Val T r> I <»ii \/I<»t r>-Mf»t 
u-L,c\x, vai, u-vai, i_>eu, u-i^cu, ivici, i-/-ivici 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D- 
Met, He, D-Ile, Om, D-Om 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, Leu, D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, 
Trans-3,4, or 5-phenylproline, cis-3,4, 
or 5-phenylproline 


Proline 


P 


D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-1- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), 
D-Met(O), L-Cys, D-Cys 


Threonine 


T 


b-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), 
D-Met(O), Val, D-Val 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


Valine 


V 


D-Val, Leu, D-Leu, He, D-Ile, Met, D-Met 
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Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
(which replace the peptide bonds) in the peptide sequence. Also included are: analogs 
that include residues other than naturally occurring L-amino acids, e.g., D-amino acids 
5 or non-naturally occurring or synthetic amino acids, e.g., (5 or y amino acids; and cyclic 
analogs. 

As used herein, the term "fragment", as applied to an K pylori analog, will 
ordinarily be at least about 20 residues, more typically at least about 40 residues, 
preferably at least about 60 residues in length. Fragments of K pylori polypeptides can 

1 0 be generated by methods known to those skilled in the art. The ability of a candidate 
fragment to exhibit a biological activity of K pylori polypeptide can be assessed by 
methods known to those skilled in the art as described herein. Also included are H. 
pylori polypeptides containing residues that are not required for biological activity of the 
peptide or that result from alternative mRNA splicing or alternative protein processing 

1 5 events. 

An "immunogenic component" as used herein is a moiety, such as an //, pylori 
polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or 
cellular immune response in a host animal alone or in combination with an adjuvant. 
An "antigenic component" as used herein is a moiety, such as an H. pylori 
20 polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody 
with sufficiently high affinity to form a detectable antigen-antibody complex. 

As used herein, the term "transgene" means a nucleic acid (encoding, e.g., one or 
more polypeptides), which is partly or entirely heterologous, i.e., foreign, to the 
transgenic animal or cell into which it is introduced, or, is homologous to an endogenous 
25 gene of the transgenic animal or cell into which it is introduced, but which is designed to 
be inserted, or is inserted, into the cell's genome in such a way as to alter the genome of 
the cell into which it is inserted (e.g., it is inserted at a location which differs from that 
of the natural gene or its insertion results in a knockout). A transgene can include one or 
more transcriptional regulatory sequences and any other nucleic acid, such as introns, 
30 that may be necessary for optimal expression of the selected nucleic acid, all operably 
linked to the selected nucleic acid, and may include an enhancer sequence. 

As used herein, the term "transgenic cell" refers to a cell containing a transgene. 
As used herein, a "transgenic animal" is any animal in which one or more, and 
preferably essentially all, of the cells of the animal includes a transgene. The transgene 
35 can be introduced into the cell, directly or indirectly by introduction into a precursor of 
the cell, by way of deliberate genetic manipulation, such as by a process of 
transformation of competent cells or by microinjection or by infection with a 
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recombinant virus. This molecule may be integrated within a chromosome, or it may be 
extrachromosomaliy replicating DNA. 

The term "antibody" as used herein is intended to include fragments thereof 
which are specifically reactive with H. pylori polypeptides. 
5 As used herein, the term "cell-specific promoter" means a DNA sequence that 

serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue. The term also covers so-called "leaky" promoters, which 
regulate expression of a selected DNA primarily in one tissue, but cause expression in 
10 other tissues as well. 

Misexpression, as used herein, refers to a non-wild type pattern of gene 
expression. It includes: expression at non-wild type levels, i.e., over or under 
expression; a pattern of expression that differs from wild type in terms of the time or 
stage at which the gene is expressed, e.g., increased or decreased expression (as 

1 5 compared with wild type) at a predetermined developmental period or stage; a pattern of 
expression that differs from wild type in terms of decreased expression (as compared 
with wild type) in a predetermined cell type or tissue type; a pattern of expression that 
differs from wild type in terms of the splicing size, amino acid sequence, post- 
transitional modification, or biological activity of the expressed polypeptide; a pattern of 

20 expression that differs from wild type in terms of the effect of an environmental stimulus 
or extracellular stimulus on expression of the gene, e.g., a pattern of increased or 
decreased expression (as compared with wild type) in the presence of an increase or 
decrease in the strength of the stimulus. 

As used herein, "host cells" and other such terms denoting microorganisms or 

25 higher eukaryotic cell lines cultured as unicellular entities refers to cells which can 

become or have been used as recipients for a recombinant vector or other transfer DNA, 
and include the progeny of the original cell which has been transfected. It is understood 
by individuals skilled in the art that the progeny of a single parental cell may not 
necessarily be completely identical in genomic or total DNA compliment to the original 

30 parent, due to accident or deliberate mutation. 

As used herein, the term "control sequence" refers to a nucleic acid having a base 
sequence which is recognized by the host organism to effect the expression of encoded 
sequences to which they are ligated. The nature of such control sequences differs 
depending upon the host organism; in prokaryotes, such control sequences generally 

35 include a promoter, ribosomal binding site, terminators, and in some cases operators; in 
eukaryotes, generally such control sequences include promoters, terminators and in 
some instances, enhancers. The term control sequence is intended to include at a 
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minimum, all components whose presence is necessary for expression, and may also 
include additional components whose presence is advantageous, for example, leader 
sequences. 

As used herein, the term "operably linked" refers to sequences joined or ligated 
5 to function in their intended manner. For example, a control sequence is operably linked 
to coding sequence by ligation in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control sequence and host cell. 

The metabolism of a substance, as used herein, means any aspect of the, 
expression, function, action, or regulation of the substance. The metabolism of a 
1 0 substance includes modifications, e.g., covalent or non-covalent modifications of the 
substance. The metabolism of a substance includes modifications, e.g., covalent or non- 
covalent modification, the substance induces in other substances. The metabolism of a 
substance also includes changes in the distribution of the substance. The metabolism of 
a substance includes changes the substance induces in the distribution of other 
1 5 substances. 

A "sample" as used herein refers to a biological sample, such as, for example, 
tissue or fluid isloated from an individual (including without limitation plasma, serum, 
cerebrospinal fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture 
constituents, as well as samples from the environment. 
20 The practice of the invention will employ, unless otherwise indicated, 

conventional techniques of chemistry, molecular biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of the art. Such techniques are 
explained fully in the literature. See e.g., Sambrook, Fritsch, and Maniatis, Molecular 
Cloning: Laboratory Manual 2nd ed. (1989); DNA Cloning, Volumes I and II (D.N 
25 Glover ed. 1 985); Oligonucleotide Synthesis (M.J. Gait ed, 1 984); Nucleic Acid 
Hybridization (B.D. Hames & S.J. Higgins eds. 1984); the series. Methods in 
Enzymoloqy (Academic Press, Inc.), particularly Vol. 1 54 and Vol. 155 (Wu and 
Grossman, eds.) and PCR-A Practical Approach (McPherson, Quirke, and Taylor, eds., 
1991). 



30 



I. Isolation of Nucleic Acids of H. pylori and Uses Therefor 



H. pylori Genomic Sequence 

This invention provides nucleotide sequences of the genome of//, pylori which 
35 thus comprises a DNA sequence library of H. pylori genomic DNA. The detailed 

description that follows provides nucleotide sequences of//, pylori, and also describes 
how the sequences were obtained and how ORFs and protein-coding sequences were 
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identified. Also described are methods of using the disclosed H, pylori sequences in 
methods including diagnostic and therapeutic applications. Furthermore, the library can 
be used as a database for identification and comparison of medically important 
sequences in this and other strains of K pylori, 
5 To determine the genomic sequence of H. pylori^ DNA was isolated from a strain 

of//, pylori (ATCC # 55679; deposited by Genome Therapeutics Corporation, 100 
Beaver Street, Waltham, MA 02154) and mechanically sheared by nebulization to a 
median size of 2 kb. Following size fractionation by gel electrophoresis, the fragments 
were blunt-ended, ligated to adapter oligonucleotides, and cloned into each of 20 

10 different pMPX vectors (Rice et al., abstracts of Meeting of Genome Mapping and 
' Sequencing, Cold Spring Harbor, NY, 5/1 1-5/15, 1994, p. 225) to construct a series of 
"shotgun" subclone libraries. 

DNA sequencing was achieved using multiplex sequencing procedures 
essentially as disclosed in Church et al., 1988, Science 240:185: U.S. Patents No. 

15 4,942,124 and 5,149,625). DNA was extracted from pooled cultures and subjected to 
chemical or enzymatic sequencing. Sequencing reactions were resolved by 
electrophoresis, and the products were transferred and covalently bound to nylon 
membranes. Finally, the membranes were sequentially hybridized with a series of 
labelled oligonucleotides complimentary to "tag" sequences present in the different 

20 shotgun cloning vectors. In this manner, a large number of sequences could be obtained 
from a single set of sequencing reactions. The cloning and sequencing procedures are 
described in more detail in the Exemplification. 

Individual sequence reads obtained in this manner were assembled using the 
FALCON™ program (Church et al.. 1994, Automated DNA Sequencing and Analysis, 

25 J.C. Venter, ed.. Academic Press) and PHRAP (P. Green, Abstracts of DOE Human 
Genome Program Contractor-Grantee Workshop V, Jan. 1996, p. 157). The average 
contig length wfis about 3-4 kb. 

A variety of approaches are used to order the contigs so as to obtain a continuous 
sequence representing the entire K pylori genome. Synthetic oligonucleotides are 

30 designed that are complementary to sequences at the end of each contig. These 
oligonucleotides may be hybridized to libaries of H. pylori genomic DNA in, for 
example, lambda phage vectors or plasmid vectors to identify clones that contain 
sequences corresponding to the junctional regions between individual contigs. Such 
clones are then used to isolate template DNA and the same oligonucleotides are used as 

35 primers in polymerase chain reaction (PCR) to amplify junctional fragments, the 
nucleotide sequence of which is then determined. 
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The H. pylori sequences were analyzed for the presence of open reading frames 
(ORFs) comprising at least 1 80 nucleotides. As a result of the analysis of ORFs based 
on stop-to-stop codon reads, it should be understood that these ORFs may not 
correspond to the ORF of a naturally-occurring H. pylori polypeptide. These ORFs may 
5 contain start codons which indicate the initiation of protein synthesis of a naturally- 
occurring H. pylori polypeptide. Such start codons within the ORFs provided herein can 
be identified by those of ordinary skill in the relevant art, and the resulting ORF and the 
encoded H. pylori polypeptide is within the scope of this invention. For example, within 
the ORFs a codon such as AUG or GUG (encoding methionine or valine) which is part 
10 of the initiation signal for protein synthesis can be identified and the ORF modified to 
correspond to a naturally-occurring H. pylori polypeptide. The predicted coding regions 
were defined by evaluating the coding potential of such sequences with the program 
GENEMARK™ (Borodovsky and Mclninch, 1993, Comp. Chem. 17:123). 

15 Other H. pvlori Nucleic Acids 

The nucleic acids of this invention may be obtained directly from the DN A of the 
above referenced H. pylori strain by using the polymerase chain reaction (PGR). See 
"PCR, A Practical Approach" (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, 
UK, 1991) for details about the PGR. High fidelity PGR can be used to ensure a faithful 

20 DNA copy prior to expression. In addition, the authenticity of amplified products can be 
checked by conventional sequencing methods. Clones carrying the desired sequences 
described in this invention may also be obtained by screening the libraries by means of 
the PGR or by hybridization of synthetic oligonucleotide probes to filter lifts of the 
library colonies or plaques as known in the art (see, e.g., Sambrook et al.. Molecular 

25 Cloning, A Laboratory Manual 2nd edition, 1989, Cold Spring Harbor Press, NY). 

It is also possible to obtain nucleic acids encoding H. pylori polypeptides from a 
cDNA library in accordance with protocols herein described. A cDNA encoding an H. 
pylori polypeptide can be obtained by isolating total mRNA from an appropriate strain. 
Double stranded cDNAs can then be prepared from the total mRNA. Subsequently, the 

30 cDNAs can be inserted into a suitable plasmid or viral (e.g., bacteriophage) vector using 
any one of a number of known techniques. Genes encoding H. pylori polypeptides can 
also be cloned using established polymerase chain reaction techniques in accordance 
with the nucleotide sequence information provided by the invention. The nucleic acids 
of the invention can be DNA or RNA. Preferred nucleic acids of the invention are 

35 contained in the Sequence Listing. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 
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are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See e.g., Itakura et aL U.S. 
Patent No. 4,598,049; Caruthers et aL U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 
5 Nucleic acids isolated or synthesized in accordance with features of the present 

invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands, antisense genes and for developing expression systems for the synthesis of 
proteins and peptides corresponding to such sequences. As probes, primers, capture 
ligands and antisense agents, the nucleic acid normally consists of all or part 
10 (approximately twenty or more nucleotides for specificity as well as the ability to form 
staBIe~liybfidizatioh products) of the nucleic acids of the invention contained in the 
Sequence Listing. These uses are described in further detail below. 
Probes 

A nucleic acid isolated or synthesized in accordance with the sequence of the 

15 invention contained in the Sequence Listing can be used as a probe to specifically detect 
H. pylori With the sequence information set forth in the present application, sequences 
of twenty or more nucleotides are identified which provide the desired inclusivity and 
exclusivity with respect to //. pylori, and extraneous nucleic acids likely to be 
encountered during hybridization conditions. More preferably, the sequence will 

20 comprise at least twenty to thirty nucleotides to convey stability to the hybridization 
product formed between the probe and the intended target molecules. 

Sequences larger than 1 000 nucleotides in length are difficult to synthesize but 
can be generated by recombinant DNA techniques. Individuals skilled in the art will 
readily recognize that the nucleic acids, for use as probes, can be provided with a label to 

25 facilitate detection of a hybridization product. 

Nucleic acid isolated and synthesized in accordance with the sequence of the 
invention contained in the Sequence Listing can also be useful as probes to detect 
homologous regions (especially homologous genes) of other Helicobacter species using 
appropriate stringency hybridization conditions as described herein. 

30 Capture Ligand 

For use as a capture ligand, the nucleic acid selected in the marmer described 
above with respect to probes, can be readily associated with a support. The manner in 
which nucleic acid is associated with supports is well known. Nucleic acid having 
twenty or more nucleotides in a sequence of the invention contained in the Sequence 

35 Listing have utility to separate H, pylori nucleic acid from the nucleic acid of each other 
and other organisms. Nucleic acid having twenty or more nucleotides in a sequence of 
the invention contained in the Sequence Listing can also have utility to separate other 
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Helicobacter species from each other and from other organisms. Preferably, the 
sequence will comprise at least twenty nucleotides to convey stability to die 
hybridization product formed between the probe and the intended target molecules. 
Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be 
5 generated by recombinant DNA techniques. 
Primers 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility as primers for the amplification of//, pylori nucleic acid. These 
nucleic acids may also have utility as primers for the amplification of nucleic acids in 

10 other Helicobacter species. With respect to polymerase chain reaction (PGR) 

techniques, nucleic acid sequences of > 10-15 nucleotides of the invention contained in 
the Sequence Listing have utility in conjunction with suitable enzymes and reagents to 
create copies of//, pylori nucleic acid. More preferably, the sequence will comprise 
twenty or more nucleotides to convey stability to the hybridization product formed 

1 5 between the primer and the intended target molecules. Binding conditions of primers 
greater than 100 nucleotides are more difficult to control to obtain specificity. High 
fidelity PGR can be used to ensure a faithful DNA copy prior to expression. In addition, 
amplified products can be checked by conventional sequencing methods. 

The copies can be used in diagnostic assays to detect specific sequences, 

20 including genes from H. pylori and/or other Helicobacter species. The copies can also 
be incorporated into cloning and expression vectors to generate polypeptides 
corresponding to the nucleic acid synthesized by PGR, as is described in greater detail 
herein. 

Antisense 

25 Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 

accordance with the sequences described herein have utility as antisense agents to 
prevent the expression of H. pylori genes. These sequences also have utility as antisense 
agents to prevent expression of genes of other Helicobacter species. 

In one embodiment, nucleic acid or derivatives corresponding to H pylori 

30 nucleic acids is loaded into a suitable carrier such as a liposome or bacteriophage for 
introduction into bacterial cells. For example, a nucleic acid having twenty or more 
nucleotides is capable of binding to bacteria nucleic acid or bacteria messenger RNA. 
Preferably, the antisense nucleic acid is comprised of 20 or more nucleotides to provide 
necessary stability of a hybridization product of non-natiirally occurring nucleic acid and 

35 bacterial nucleic acid and/or bacterial messenger RNA. Nucleic acid having a sequence 
greater than 1000 nucleotides in length is difficult to synthesize but can be generated by 
recombinant DNA techniques. Methods for loading antisense nucleic acid in liposomes 
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is known in the art as exemplified by U.S. Patent 4,241,046 issued December 23, 1980 
to Papahadjopoulos et al. 

II. Expression of K pylori Nucleic Acids 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility to generate polypeptides. The nucleic acid of the invention 
exemplified in the Sequence Listing or fragments of said nucleic acid encoding active 
portions of H, pylori polypeptides can be cloned into suitable vectors or used to isolate 
nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and 
cloned into a suitable vector. 

The funct ion of a specific gene"or operon can be^ascertaiiSed by expression in a 
bacterial strain under conditions where the activity of the gene product(s) specified by 
the gene or operon in question can be specifically measured. Alternatively, a gene 
product may be produced in large quantities in an expressing strain for use as an antigen, 
an industrial reagent, for structural studies, etc. This expression can be accomplished in 
a mutant strain which lacks the activity of the gene to be tested, or in a strain that does 
not produce the same gene product(s). This includes, but is not limited to other 
Helicobacter strains, or other bacterial strains such as E, coli^ Norcardia^ 
Corynebacterium^ Campylobacter^ and Streptomyces species. In some cases the 
expression host will utilize the natural Helicobacter promoter whereas in others, it will 
be necessary to drive the gene with a promoter sequence derived from the expressing 
organism (e.g., an E, coli beta-galactosidase promoter for expression in E, coli). 

To express a gene product using the natural H. pylori promoter, a procedure such 
as the following can be used. A restriction fragment containing the gene of interest, 
together with its associated natural promoter element and regulatory sequences 
(identified using the DNA sequence data) is cloned into an appropriate recombinant 
plasmid containing an origin of replication that functions in the host organism and an 
appropriate selectable marker. This can be accomplished by a number of procedures 
known to those skilled in the art. It is most preferably done by cutting the plasmid and 
the fragment to be cloned with the same restriction enzyme to produce compatible ends 
that can be ligated to join the two pieces together. The recombinant plasmid is 
introduced into the host organism by, for example, electroporation and cells containing 
the recombinant plasmid are identified by selection for the marker on the plasmid. 
Expression of the desired gene product is detected using an assay specific for that gene 
product. 

In the case of a gene that requires a different promoter, the body of the gene 
(coding sequence) is specifically excised and cloned into an appropriate expression 
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plasmid. This subcloning can be done by several methods, but is most easily 
accomplished by PGR amplification of a specific fragment and ligation into an 
expression plasmid after treating the PGR product with a restriction enzyme or 
exonuclease to create suitable ends for cloning. 
5 A suitable host cell for expression of a gene can be any procaryotic or eucaryotic 

cell. For example, an H. pylori polypeptide can be expressed in bacterial cells such as E. 
coli, insect cells (baculovirus), yeast, or mammalian cells such as Ghinese hamster ovary 
cell (GHO). Other suitable host cells are known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead 

1 0 to partial or complete glycosy lation and/or formation of relevant inter- or intra-chain 

disulfide bonds of a recombinant peptide product. Examples of vectors for expression in 
yeast S. cerivisae include pYepSecl (Baldari. et al., (1987) Embo J. 6:229-234), pMFa 
(Kurjan and Herskowitz, (1982) Ce// 30:933-943), pJRY88 (Schultz et al., (1987) Gene 
54: 1 13-123), and pYES2 (Invitrogen Corporation, San Diego, GA). Baculovirus vectors 

1 5 available for expression of proteins in cultured insect cells (SF 9 cells) include the pAc 
series (Smith et al., (1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow, 
V.A., and Summers, M.D., (1989) Virology 170:3 1-39). Generally, GOS cells 
(Gluzman, Y., (1981) Gell 23:175-182) are used in conjunction with such vectors as 
pGDM 8 (Arufifo, A. and Seed, B., (1987) Proc. Natl. Acad. Sci. USA 84:8573-8577) for 

20 transient amplification/expression in mammalian cells, while GHO (dhfr" Chinese 

Hamster Ovary) cells are used with vectors such as pMT2PG (Kaufman et al. (1987), 
EMBO J. 6:187-195) for stable amplification/expression in mammalian cells. Vector 
DNA can be introduced into mammalian cells via conventional techniques such as 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 

25 transfection, or electroporation. Suitable methods for transforming host cells can be 

found in Sambrook et al. ( Molecular Cloning: A La boratory Manual, 2nd Edition, Cold 
Spring Harbor Laboratory press (1989)), and other laboratory textbooks. 

Expression in procaryotes is most often carried out in E. coli with either fusion 
or non-ftision inducible expression vectors. Fusion vectors usually add a number of 

30 NH2 terminal amino acids to the expressed target gene. These NH2 terminal amino 
acids often are referred to as a reporter group. Such reporter groups usually serve two 
purposes: 1) to increase the solubility of the target recombinant protein; and 2) to aid in 
the purification of the target recombinant protein by acting as a ligand in affinity 
purification. Often, in fiision expression vectors, a proteolytic cleavage site is 

3 5 introduced at the junction of the reporter group and the target recombinant protein to 
enable separation of the target recombinant protein from the reporter group subsequent 
to purification of the fusion protein. Such enzymes, and their cognate recognition 
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sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England 
Biolabs, Beverly, MA) and pRJT5 (Pharmacia, Piscataway, NJ) which fuse glutathione 
S-transferase, maltose E binding protein, or protein A, respectively, to the target 
5 recombinant protein. A preferred reporter group is poly(His), which may be fused to the 
amino or carboxy terminus of the protein and which renders the recombinant fusion 
protein easily purifiable by metal chelate chromatography. 

Inducible non-fusion expression vectors include pTrc (Amann et al., (1988) Gene 
69:301-315) and pETl Id (Studier et al.. Gene Expression Technologv: Methods in 
1 0 Enzvmology 185 , Academic Press, San Diego, Califomia (1 990) 60-89). While target 
gene expression relies on host RNA polymerase transcription from the hybrid trp-lac 
fusion promoter in pTrc, expression of target genes inserted into pETl Id relies on 
transcription from the T7 gnlO-lac 0 fusion promoter mediated by coexpressed viral 
RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains 

1 5 BL21(DE3) or HMS 1 74(DE3) from a resident X prophage harboring a T7 gnl under the 
transcriptional control of the lacUV 5 promoter. 

For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding an K pylori polypeptide can be cultured 
under appropriate conditions to allow expression of the polypeptide to occur. The 

20 polypeptide may be secreted and isolated from a mixture of cells and medium containing 
the peptide. Alternatively, the polypeptide may be retained cytoplasmically and the cells 
harvested, lysed and the protein isolated. A cell culture includes host cells, media and 
other byproducts. Suitable media for cell culture are well known in the art. 
Polypeptides of the invention can be isolated from cell culture medium, host cells, or 

25 both using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 
immunoaffinity purification with antibodies specific for such polypeptides. 
Additionally, in many situations, polypeptides can be produced by chemical cleavage of 
a native protein (e.g., tryptic digestion) and the cleavage products can then be purified 

30 by standard techniques. 

In the case of membrane bound proteins, these can be isolated from a host cell by 
contacting a membrane-associated protein fraction with a detergent forming a 
solubilized complex, where the membrane-associated protein is no longer entirely 
embedded in the membrane fraction and is solubilized at least to an extent which allows 

35 it to be chromatographically isolated from the membrane fraction. Several different 

criteria are used for choosing a detergent suitable for solubilizing these complexes. For 
example, one property considered is the ability of the detergent to solubilize the H. 
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pylori protein within the membrane fraction at minimal denaturation of the membrane- 
associated protein allowing for the activity or functionality of the membrane-associated 
protein to return upon reconstitution of the protein. Another property considered when 
selecting the detergent is the critical micelle concentration (CMC) of the detergent in 
5 that the detergent of choice preferably has a high CMC value allowing for ease of 

removal after reconstitution. A third property considered when selecting a detergent is 
the hydrophobicity of the detergent. Typically, membrane-associated proteins are very 
hydrophobic and therefore detergents which are also hydrophobic, e.g., the triton series, 
would be useful for solubilizing the hydrophobic proteins. Another property important 

10 to a detergent can be the capability of the detergent to remove the H. pylori protein with 
minimal protein-protein interaction facilitating further purification. A fifth property of 
the detergent which should be considered is the charge of the detergent. For example, if 
it is desired to use ion exchange resins in the purification process then preferably 
detergent should be an uncharged detergent. Chromatographic techniques which can be 

1 5 used in the final purification step are known in the art and include hydrophobic 
interaction, lectin affinity, ion exchange, dye affinity and inmiunoaffinity. 

One strategy to maximize recombinant H. pylori peptide expression in E. coli is 
to express the protein in a host bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, S., Gene Expre ssion Technology: Methods 

20 in Enzvmology 185 , Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy would be to alter the nucleic acid encoding an H, pylori peptide to be inserted 
into an expression vector so that the individual codons for each amino acid would be 
those preferentially utilized in highly expressed E, coli proteins (Wada et al., (1992) 
Nuc. Acids Res. 20:2 111-2118). Such alteration of nucleic acids of the invention can be 

25 carried out by standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 
are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. 

30 Patent No. 4,598,049; Camthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

III. H. pylori Polypeptides 

This invention encompasses isolated H, pylori polypeptides encoded by the 
35 disclosed K pylori genomic sequences, including the polypeptides of the invention 

contained in the Sequence Listing. Polypeptides of the invention are preferably at least 
5 amino acid residues in length. Using the DNA sequence information provided herein. 



wo 98/18323 



PCTAJS97/19S75 



-35 - 

the amino acid sequences of the polypeptides encompassed by the invention can be 
deduced using methods well-known in the art. It will be understood that the sequence of 
an entire nucleic acid encoding an //. pylori polypeptide can be isolated and identified 
based on an ORP that encodes only a fragment of the cognate protein-coding region. 
5 This can be acheived, for example, by using the isolated nucleic acid encoding the ORF, 
or fragments thereof, to prime a polymerase chain reaction with genomic H. pylori DNA 
as template; this is followed by sequencing the amplified product. 

The polypeptides of the invention can be isolated from wild-type or mutant H. 
pylori cells or from heterologous organisms or cells (including, but not limited to, 
10 bacteria, yeast, insect, plant and mammalian cells) into which an H. pylori nucleic acid 
has been introduced and expressed. In addition, the polypeptides can be part of 
recombinant fusion proteins. 

H. pylori polypeptides of the invention can be chemically synthesized using 
commercially automated procedures such as those referenced herein, 

15 

IV. Identification of Nucleic Acids Encoding Vaccine Components and Targets for 
Agents Effective Against H. pylori 

The disclosed K pylori genome sequence includes segments that direct the 
synthesis of ribonucleic acids and polypeptides, as well as origins of replication, 

20 promoters, other types of regulatory sequences, and intergenic nucleic acids. The 

invention encompasses nucleic acids encoding immunogenic components of vaccines 
and targets for agents effective against H. pylori. Identification of said immunogenic 
components involved in the determination of the function of the disclosed sequences can 
be achieved using a variety of approaches. Non-limiting examples of these approaches 

25 are described briefly below. 

Homologv to known sequences: Computer-assisted comparison of the disclosed 
K pylori sequences with previously reported sequences present in publicly available 
databases is useful for identifying functional K pylori nucleic acid and polypeptide 
sequences. It will be understood that protein-coding sequences, for example, may be 

30 compared as a whole, and that a high degree of sequence homology between two 

proteins (such as, for example, >80-90%) at the amino acid level indicates that the two 
proteins also possess some degree of functional homology, such as, for example, among 
enzymes involved in metabolism, DNA synthesis, or cell wall synthesis, and proteins 
involved in transport, cell division, etc. In addition, many stractural features of 

35 particular protein classes have been identified and correlate with specific consensus 

sequences, such as, for example, binding domains for nucleotides, DNA, metal ions, and 
other small molecules; sites for covalent modifications such as phosphorylation. 
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acylation, and the like; sites of proteinrprotein interactions, etc. These consensus 
sequences may be quite short and thus may represent only a fraction of the entire 
protein-coding sequence. Identification of such a feature in an H, pylori sequence is 
therefore useful in determining the function of the encoded protein and identifying 
5 useful targets of antibacterial drugs. 

Of particular relevance to the present invention are structural features that are 
common to secretory, transmembrane, and surface proteins, including secretion signal 
peptides and hydrophobic transmembrane domains. H, pylori proteins identified as 
containing putative signal sequences and/or transmembrane domains are useful as 

1 0 immunogenic components of vaccines. 

Identification of essential genes: Nucleic acids that encode proteins essential for 
growth or viability of H. pylori are preferred drug targets. K pylori genes can be tested 
for their biological relevance to the organism by examining the effect of deleting and/or 
disrupting the genes, i.e., by so-called gene "knockout", using techniques known to 

1 5 those skilled in the relevant art. In this manner, essential genes may be identified. 

Strain-specific sequences: Because of the evolutionary relationship between 
different K pylori strains, it is believed that the presently disclosed H, pylori sequences 
are useful for identifying, and/or discriminating between, previously known and new K 
pylori strains. It is believed that other //. pylori strains will exhibit at least 70% 

20 sequence homology with the presently disclosed sequence. Systematic and routine 
analyses of DNA sequences derived from samples containing K pylori strains, and 
comparison with the present sequence allows for the identification of sequences that can 
be used to discriminate between strains, as well as those that are common to all H. pylori 
strains. In one embodiment, the invention provides nucleic acids, including probes, and 

25 peptide and polypeptide sequences that discriminate between different strains of//. 

pylori. Strain-specific components can also be identified functionally by their ability to 
elicit or react with antibodies that selectively recognize one or more H. pylori strains. 

In another embodiment, the invention provides nucleic acids, including probes, 
and peptide and polypeptide sequences that are common to all H. pylori strains but are 

30 «c»/ found in other bacterial species. 

Specific Example: Determination Of Candidate Protein Antigens For Antibody And 
Vaccine Development 

The selection of candidate protein antigens for vaccine development can be 
35 derived from the nucleic acids encoding H, pylori polypeptides. First, the ORF's can be 
analyzed for homology to other known exported or membrane proteins and analyzed 
using the discriminant analysis described by Klein, et al. (Klein, P., Kanehsia, M., and 
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DeLisi, C. (1985) Biochimica et Biophysica Acta 815, 468-476) for predicting exported 
and membrane proteins. 

Homology searches can be performed using the BLAST algorithm contained in 
the Wisconsin Sequence Analysis Package (Genetics Computer Group, University 
5 Research Park, 575 Science Drive, Madison, WI 5371 1) to compare each predicted ORF 
amino acid sequence with all sequences found in the current GenBank, SWISS-PROT 
and PIR databases. BLAST searches for local alignments between the ORF and the 
databank sequences and reports a probability score which indicates the probability of 
finding this sequence by chance in the database. ORF's with significant homology (e.g. 

10 probabilities lower than 1x10'^ that the homolo gy is onl y due to random chance) to 
membrane or exported proteins represent protein antigens for vaccine development. 
Possible functions can be provided to H. pylori genes based on sequence homology to 
genes cloned in other organisms. 

Discriminant analysis (Klein, et al. supra) can be used to examine the ORF 

15 amino acid sequences. This algorithm uses the intrinsic information contained in the 

ORF amino acid sequence and compares it to information derived from the properties of 
known membrane and exported proteins. This comparison predicts which proteins will 
be exported, membrane associated or cytoplasmic. ORF amino acid sequences 
identified as exported or membrane associated by this algorithm are likely protein 

20 antigens for vaccine development. 

Surface exposed outer membrane proteins are likely to represent the best 
antigens to provide a protective immune response against H. pylori. Among the 
algorithms that can be used to aid in prediction of these outer membrane proteins include 
the presence of an amphipathic beta-sheet region at their C-terminus. This region which 

25 has been detected in a large number of outer membrane proteins in Gram negative 
bacteria is often characterized by hydrophobic residues (Phe or Tyr) clustered at 
alternating positions from the C-terminus (e.g., see Figure 5, block F; Figure 7, block E). 
Importantly, these sequences have not been detected at the C-termini of periplasmic 
proteins, thus allowing preliminary distinction between these classes of proteins based 

30 on primary sequence data. This phenomenon has been reported previously by Struyve et 
al. {J. Mai BioL 218:141-148, 1991). 

Also illustrated in Figure 5 are additional amino acid sequence motifs found in 
many outer membrane proteins of K pylori. The amino acid sequence alignment in 
Figure 5 depicts portions of the sequence of five //. pylori proteins (depicted in the 

35 single letter amino acid code) labeled with their amino acid Sequence ID Numbers and 
shown N-terminal to C-terminal, left to right. Five or six distinct blocks (labeled A 
through E or F) of similar amino acid residues are found including the distinctive 
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hydrophobic residues (Phe or Tyr; F or Y according to the single letter code for amino 
acid residues) frequently found at positions near the C-terminus of outer membrane 
proteins. The presence of several shared motifs clearly establishes the similarity 
between members of this group of proteins. 
5 Additional amino acid alignments for four outer membrane proteins isolated 

from K pylori are depicted in Figure 6. 

Outer membrane proteins isolated from K pylori frequently share additional 
motifs as depicted for two proteins in Figure 7 which also share the C-terminal 
hydrophobic residues, and as depicted for two proteins in Figure 8 which do not share 
10 the C-terminal hydrophobic residue motif but share a different C-terminal motif. 

One skilled in the art would know that these shared sequence motifs are highly 
significant and establish a similarity among this group of proteins. 

Infrequently it is not possible to distinguish between multiple possible 
nucleotides at a given position in the nucleic acid sequence. In those cases the 
1 5 ambiguities are denoted by an extended alphabet as follows: 

These are the official lUPAC-IUB single-letter base codes 



Code 


Bsise Description 




G 


Guanine 




A 


Adenine 




T 


Thymine 




C 


Cytosine 




R 


Purine 


(A or G) 


Y 


Pyrimidine 


(C or T or U) 


M 


Amino 


(A or C) 


K 


Ketone 


(G orT) 


S 


Strong interaction 


(C or G) 


W 


Weak interaction 


(A or T) 


H 


Not-G 


(A or C or T) 


B 


Not-A 


(C or G or T) 


V 


Not-T (not-U) 


(A or C or G) 


D 


Not-C 


(A or G or T) 


N 


Any 


(A or C or G or T) 



20 



The amino acid translations of this invention account for the ambiguity in the 
nucleic acid sequence by translating the ambiguous codon as the letter "X". In all ca5 
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the permissible amino acid residues at a position are clear from an examination of the 
nucleic acid sequence based on the standard genetic code. 

V. Production of Fragments and Analogs of H. pylori Nucleic Acids and Polypeptides 
5 Based on the discovery of the H, pylori gene products of the invention provided 

in the Sequence Lsiting, one skilled in the art can alter the disclosed structure {of K 
pylori genes), e.g., by producing fragments or analogs, and test the newly produced 
structures for activity. Examples of techniques known to those skilled in the relevant art 
which allow the production and testing of fragments and analogs are discussed below. 
10 These, or analogous methods can be used to make and screen libraries of polypeptides, 
e.g., libraries of random peptides or libraries of fragments or analogs of cellular proteins 
for the ability to bind K pylori polypeptides. Such screens are useful for the 
identification of inhibitors of//, pylori, 

15 Generation of Fragments 

Fragments of a protein can be produced in several ways, e.g., recombinantly, by 
proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 
terminal fragment) or both ends (for an internal fragment) of a nucleic acid which 

20 encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide 
fragments. Digestion with "end-nibbling" endonucleases can thus generate DNA's 
which encode an array of fragments. DNA*s which encode fragments of a protein can 
also be generated by random shearing, restriction digestion or a combination of the 
above-discussed methods. 

25 Fragments can also be chemically synthesized using techniques known in the art 

such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 
peptides of the present invention may be arbitrarily divided into fragments of desired 
length with no overlap of the fragments, or divided into overlapping fragments of a 
desired length. 

30 

Alteration of Nucleic Acids and Polypeptides: Random Methods 

Amino acid sequence variants of a protein can be prepared by random 
mutagenesis of DNA which encodes a protein or a particular domain or region of a 
protein. Useful methods include PGR mutagenesis and saturation mutagenesis. A 
35 library of random amino acid sequence variants can also be generated by the synthesis of 
a set of degenerate oligonucleotide sequences. (Methods for screening proteins in a 
library of variants are elsewhere herein). 
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(A) PGR Mutagenesis 

In PGR mutagenesis, reduced Taq polymerzise fidelity is used to introduce 
random mutations into a cloned fragment of DNA (Leung et ai., 1989, Technique 1:11- 
5 15). The DNA region to be mutagenized is amplified using the polymerase chain 

reaction (PGR) under conditions that reduce the fidelity of DNA synthesis by Taq DNA 
polymerase, e.g., by using a dOTP/dATP ratio of five and adding Mn2+ to the PGR 
reaction. The pool of amplified DNA fragments are inserted into appropriate cloning 
vectors to provide random mutant libraries. 

(B) Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large number of 
single base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 
229:242), This technique includes generation of mutations, e.g., by chemical treatment 

15 or irradiation of single-stranded DNA in vitro, and synthesis of a complimentary DNA 
strand. The mutation frequency can be modulated by modulating the severity of the 
treatment, and essentially all possible base substitutions can be obtained. Because this 
procedure does not involve a genetic selection for mutant fragments both neutral 
substitutions, as w^ell as those that alter function, are obtained. The distribution of point 

20 mutations is not biased toward conserved sequence elements. 

(O Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate 
oligonucleotide sequences. Ghemical synthesis of a degenerate sequences can be carried 

25 out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 

appropriate expression vector. The synthesis of degenerate oligonucleotides is known in 
the art (see for example,*Narang, SA (1983) Tetrahedron 39:3; Itakura et al, (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos, Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier pp273-289; Itakura etal. (19M) Annu. Rev, Biochem, 53:323; 

30 Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1:477. Such 
techniques have been employed in the directed evolution of other proteins (see, for 
example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429- 
2433; Devlin et al. (1990) Science 249: 404-406; Gwirla et al. (1990) PNAS 87: 6378- 
6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 
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Alteration of Nucleic Acids and Polypeptides: Methods for Directed Mutagenesis 

Non-random or directed, mutagenesis techniques can be used to provide specific 
sequences or mutations in specific regions. These techniques can be used to create 
v£iriants which include, e.g., deletions, insertions, or substitutions, of residues of the 
5 known amino acid sequence of a protein. The sites for mutation can be modified 

individually or in series, e.g., by (1) substituting first with conserved amino acids and 
then with more radical choices depending upon results achieved, (2) deleting the target 
residue, or (3) inserting residues of the same or a different class adjacent to the located 
site, or combinations of options 1-3. 

10 

( A) Alanine Scanning Mutagenesis 

Alanine scanning mutagenesis is a useful method for identification of certain 
residues or regions of the desired protein that are preferred locations or domains for 
mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine 

1 5 scanning, a residue or group of target residues are identified (e.g., charged residues such 
as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino 
acid (most preferably alanine or poly alanine). Replacement of an amino acid can affect 
the interaction of the amino acids with the surrounding aqueous environment in or 
outside the cell. Those domains demonstrating functional sensitivity to the substitutions 

20 are then refined by introducing further or other variants at or for the sites of substitution. 
Thus, while the site for introducing an amino acid sequence variation is predetermined, 
the nature of the mutation per se need not be predetermined. For example, to optimize 
the performance of a mutation at a given site, alanine scarming or random mutagenesis 
may be conducted at the target codon or region and the expressed desired protein subunit 

25 variants are screened for the optimal combination of desired activity. 

(B) Oligonucleotide-Mediated Mutagenesis 

Oligonucleotide-mediated mutagenesis is a useful method for preparing 
substitution, deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 

30 2: 1 83, 1983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide 

encoding a mutation to a DNA template, where the template is the single-stranded form 
of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the 
desired protein. After hybridization, a DNA polymerase is used to synthesize an entire 
second complementary strand of the template that will thus incorporate the 

35 oligonucleotide primer, and will code for the selected alteration in the desired protein 
DNA. Generally, oligonucleotides of at least 25 nucleotides in length are used. An 
optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
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complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the single- 
stranded DNA template molecule. The oligonucleotides are readily synthesized using 
techniques known in the art such as that described by Crea et al. {Proc. Natl. Acad. Sci. 
5 USA, 75: 5765[1978]). 

fO Cassette Mutagenesis 

Another method for preparing variants, cassette mutagenesis, is based on the 
technique described by Wells et al. {Gene, 34:3 15 [1985]). The starting material is a 

10 plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 
codon(s) in the protein subunit DNA to be mutated are identified, there must be a 
unique restriction endonuclease site on each side of the identified mutation site(s). If no 
such restriction sites exist, they may be generated using the above-described 
oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations 

15 in the desired protein subunit DNA. After the restriction sites have been introduced into 
the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded 
oligonucleotide encoding the sequence of the DNA between the restriction sites but 
containing the desired mutation(s) is synthesized using standard procedures. The two 
strands are synthesized separately and then hybridized together using standard 

20 techniques. This double-stranded oligonucleotide is referred to as the cassette. This 
cassette is designed to have 3" and 5' ends that are comparable with the ends of the 
linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now 
contains the mutated desired protein subunit DNA sequence. 

25 CD^ Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants (Ladner et al., 
WO 88/06630). In this method, the amino acid sequences for a group of homologs or 
other related proteins are aligned, preferably to promote the highest homology possible. 
All of the amino acids which appear at a given position of the aligned sequences can be 

30 selected to create a degenerate set of combinatorial sequences. The variegated library of 
variants is generated by combinatorial mutagenesis at the nucleic acid level, and is 
encoded by a variegated gene library. For example, a mixture of synthetic 
oligonucleotides can be enzymatically ligated into gene sequences such that the 
degenerate set of potential sequences are expressible as individual peptides, or 

3 5 alternatively, as a set of larger fusion proteins containing the set of degenerate 
sequences. 
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Other Modifications of H. pylori Nucleic Acids and Polypeptides 

It is possible to modify the structure of an H. pylori polypeptide for such 
purposes as increasing solubility, enhancing stability (e.g., shelf life ex vivo and 
resistance to proteolytic degradation in vivo). A modified H, pylori protein or peptide 
5 can be produced in which the amino acid sequence has been altered, such as by amino 
acid substitution, deletion, or addition as described herein. 

An H. pylori peptide can also be modified by substitution of cysteine residues 
preferably with alanine, serine, threonine, leucine or glutamic acid residues to minimize 
dimerization via disulfide linkages. In addition, amino acid side chains of fragments of 

10 the protein of the invention can be chemically modified. Another modification is 
cyclization of the pepfideT 

In order to enhance stability and/or reactivity, an H, pylori polypeptide can be 
modified to incorporate one or more polymorphisms in the amino acid sequence of the 
protein resulting from any natural allelic variation. Additionally, D-amino acids, non- 

1 5 natural amino acids, or non-amino acid analogs can be substituted or added to produce a 
modified protein within the scope of this invention. Furthermore, an H. pylori 
polypeptide can be modified using polyethylene glycol (PEG) according to the method 
of A. Sehon and co-workers (Wie et al., supra) to produce a protein conjugated with 
PEG. In addition, PEG can be added during chemical synthesis of the protein. Other 

20 modifications of H. pylori proteins include reduction/alky lation (Tarr, Methods of 
Protein Microcharacterization^ J. E. Silver ed., Humana Press, Clifton NJ 155-194 
(1986)); acy lation (Tarr, supra); chemical coupling to an appropriate carrier (Mishell and 
Shiigi, eds. Selected Methods in Cellular Immunology^ WH Freeman, San Francisco, CA 
(1980), U.S. Patent 4,939,239; or mild formalin treatment (Marsh, (1971) Int. Arch, of 

25 Allergy andAppl. Immunol , 4i: 1 99 - 2 1 5). 

To facilitate purification and potentially increase solubility of an H. pylori 
protein or peptide, it is possible to add an amino acid fusion moiety to the peptide 
backbone. For ex£imple, hexa-histidine can be added to the protein for purification by 
immobilized metal ion affinity chromatography (Hochuli, E. et al., (1988) 

30 Bio/Technology, 6: 1321 - 1325). In addition, to facilitate isolation of peptides free of 
irrelevant sequences, specific endoprotease cleavage sites can be introduced between the 
sequences of the fusion moiety and the peptide. 

To potentially aid proper antigen processing of epitopes within an H. pylori 
polypeptide, canonical protease sensitive sites can be engineered between regions, each 

35 comprising at least one epitope via recombinant or synthetic methods. For example, 

charged amino acid pairs, such as KK or RR, can be introduced between regions within 
a protein or fragment during recombinant construction thereof. The resulting peptide 
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can be rendered sensitive to cleavage by cathepsin and/or other trypsin-like enzymes 
which would generate portions of the protein containing one or more epitopes. In 
addition, such charged amino acid residues can result in an increase in the solubility of 
the peptide. 

5 

Primary Methods for Screening Polvpep tides and Analogs 

Various techniques are known in the art for screening generated mutant gene 
products. Techniques for screening large gene libraries often include cloning the gene 
library into replicable expression vectors, transforming appropriate cells with the 

1 0 resulting library of vectors, and expressing the genes under conditions in which 

detection of a desired activity, e.g., in this case, binding to H. pylori polypeptide or an 
interacting protein, facilitates relatively easy isolation of the vector encoding the gene 
whose product was detected. Each of the techniques described below is amenable to 
high through-put analysis for screening large numbers of sequences created, e.g., by 

15 random mutagenesis techniques. 



(A) Two Hybrid Systems 

Two hybrid assays such as the system described above (as with the other 
screening methods described herein), can be used to identify polypeptides, e.g., 

20 fragments or analogs of a naturally-occurring H. pylori polypeptide, e.g., of cellular 
proteins, or of randomly generated polypeptides which bind to an H. pylori protein. 
(The H. pylori domain is used as the bait protein and the library of variants are expressed 
as fish fusion proteins.) In an analogous fashion, a two hybrid assay (as with the other 
screening methods described herein), can be used to find polypeptides which bind a H. 

25 pylori polypeptide. 

(R) Display Libraries 

In one approach to screening assays, the candidate peptides are displayed on the 
surface of a cell or viral particle, and the ability of particular cells or viral particles to 

30 bind an appropriate receptor protein via the displayed product is detected in a "panning 
assay". For example, the gene library can be cloned into the gene for a surface 
membrane protein of a bacterial cell, and the resulting fusion protein detected by 
panning (Ladner et al., WO 88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; 
and Goward et al. (1992) TIBS 1 8:136-140). In a similar fashion, a detectably labeled 

35 ligand can be used to score for potentially functional peptide homologs. Fluorescently 
labeled ligands, e.g., receptors, can be used to detect homologs which retain ligand- 
binding activity. The use of fluorescently labeled ligands, allows cells to be visually 
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inspected and separated under a fluorescence microscope, or, where the morphology of 
the cell permits, to be separated by a fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protein on the surface of a viral 
particle. For instance, in the filamentous phage system, foreign peptide sequences can 
5 be expressed on the surface of infectious phage, thereby conferring two significant 
benefits. First, since these phage can be applied to affinity matrices at concentrations 
well over 10^^ phage per milliliter, a large number of phage can be screened at one time. 
Second, since each infectious phage displays a gene product on its surface, if a particular 
phage is recovered from an affinity matrix in low yield, the phage can be amplified by 

10 another round of infection. The group of almost identical £. coli filamentous phages 
M13, fd., and ff are most often used in phage display libraries. Either of the phage glll 
or gVIII coat proteins can be used to generate fusion proteins without disrupting the 
ultimate packaging of the viral particle. Foreign epitopes can be expressed at the NH2- 
terminal end of pIII and phage bearing such epitopes recovered from a large excess of 

15 phage lacking this epitope (Ladner et al. PCT publication WO 90/02909; Garrard et al., 
PCT publication WO 92/09690; Marks et al. (1992) J. BioL Chem, 267:16007-16010; 
Griffiths et al. (1993) EMBO J 12:725-734; Clackson et al. {\99\) Nature 352:624-628; 
and Barbas et al. (1992) PNAS 89:4457-4461). 

A common approach uses the maltose receptor of E. coli (the outer membrane 

20 protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037), 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
available for binding to ligands, e.g., to antibodies, and can elicit an immune response 
when the cells are administered to animals. Other cell surface proteins, e.g., OmpA 

25 (Schorr et al. (1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 
37-45), and PAL (Fuchs et al. {\99\) Bio/Tech 9, 1369-1372), as well as large bacterial 
surface structures have served as vehicles for peptide display. Peptides can be fused to 
pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange 
of genetic information (Thiry et al. (1989) AppL Environ. Microbiol. 55, 984-993). 

30 Because of its role in interacting with other cells, the pilus provides a useful support for 
the presentation of peptides to the extracellular environment. Another large surface 
structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of 
peptides to the subunit protein flagellin offers a dense array of many peptide copies on 
the host cells (Kuwajima et al. (1988) Bio/Tech. 6, 1080-1083). Surface proteins of 

35 other bacterial species have also served as peptide fusion partners. Examples include the 
Staphylococcus protein A and the outer membrane IgA protease of Neisseria (Hansson 
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et al. (1992) J. Bacterial. 174, 4239-4245 and Klauser et al. (1990) EMBOJ. 9, 1991 - 
1999). 

In the filamentous phage systems and the LamB system described above, the 
physical link between the peptide and its encoding DNA occurs by the containment of 
5 the DNA within a particle (cell or phage) that carries the peptide on its surface. 

Capturing the peptide captures the particle and the DNA within. An alternative scheme 
uses the DNA-binding protein Lad to form a link between peptide and DNA (Cull et al. 
(1992) PNAS USA 89:1865-1869). This system uses a plasmid containing the Lad gene 
with an oligonucleotide cloning site at its 3'-end. Under the controlled induction by 
1 0 arabinose, a Lacl-peptide fusion protein is produced. This fusion retains the natural 
ability of Lad to bind to a short DNA sequence known as LacO operator (LacO). By 
installing two copies of LacO on the expression plasmid, the Lad-peptide fusion binds 
tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a 
single oligonucleotide sequence and each cell expresses only a single peptide sequence, 
1 5 the peptides become specifically and stably associated with the DNA sequence that 
directed its synthesis. The cells of the library are gently lysed and the peptide-DNA 
complexes are exposed to a matrix of immobilized receptor to recover the complexes 
containing active peptides. The associated plasmid DNA is then reintroduced into cells 
for amplification and DNA sequencing to determine the identity of the peptide ligands. 
20 As a demonstration of the practical utility of the method, a large random library of 
dodecapeptides was made and selected on a monoclonal antibody raised against the 
opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a 
consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al. 
(1992) Proc. Natl. Acad. Sci. U.S.A. 89-1869) 
25 This scheme, sometimes referred to as peptides-on-plasmids, differs in two 

important ways from the phage display methods. First, the peptides are attached to the 
C-terminus of the fusion protein, resulting in the display of the library members as 
peptides having free carboxy termini. Both of the filamentous phage coat proteins, pIII 
and pVin, are anchored to the phage through their C-termini, and the guest peptides are 
30 placed into the outward-extending N-terminal domains. In some designs, the phage- 
displayed peptides are presented right at the amino terminus of the fusion protem. 
(Cwirla, et al. (1990) Proc. Natl. Acad. Sci. U.S.A. 87, 6378-6382) A second difference 
is the set of biological biases affecting the population of peptides actually present in the 
libraries. The Lad fusion molecules are confined to the cytoplasm of the host cells. 
35 The phage coat fusions are exposed briefly to the cytoplasm during translation but are 
rapidly secreted through the inner membrane into the periplasmic compartment, 
remaining anchored in the membrane by their C-terminal hydrophobic domains, with the 
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N-termini, containing the peptides, protruding into the periplasm while awaiting 
assembly into phage particles. The peptides in the Lad and phage libraries may differ 
significantly as a result of their exposure to different proteolytic activities. The phage 
coat proteins require transport across the inner membrane and signal peptidase 
5 processing as a prelude to incorporation into phage. Certain peptides exert a deleterious 
effect on these processes and are underrepresented in the libraries (Gallop et al, (1994) J. 
Med, Chem, 37(9): 1233-1251). These particular biases are not a factor in the Lad 
display system. 

The number of small peptides available in recombinant random libraries is 

10 enormous. Libraries of lO'^-lO^ independent clones are routinely prepared. Libraries as 
largeas 10 recombinants have been created, but this size approaches the practical 
limit for clone libraries. This limitation in library size occurs at the step of transforming 
the DNA containing randomized segments into the host bacterial cells. To circumvent 
this limitation, an in vitro system based on the display of nascent peptides in polysome 

1 5 complexes has recently been developed. This display library method has the potential of - 
producing libraries 3-6 orders of magnitude larger than the currently available 
phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 
expression of the peptides, and screening, is done in an entirely cell-free format. 
In one application of this method (Gallop et al. (1994) J. Med. Chem. 

20 37(9): 1233-1 251), a molecular DNA library encoding 10^2 decapeptides was 
constructed and the library expressed in an E. coli S30 in vitro coupled 
transcription/translation system. Conditions were chosen to stall the ribosomes on the 
mRNA, causing the accumulation of a substantial proportion of the RNA in polysomes 
and yielding complexes containing nascent peptides still linked to their encoding RNA. 

25 The polysomes are sufficiently robust to be affinity purified on immobilized receptors in 
much the same way as the more conventional recombinant peptide display libraries are 
screened. RNA fi-om the bound complexes is recovered, converted to cDNA, and 
amplified by PGR to produce a template for the next round of synthesis and screening. 
The polysome display method can be coupled to the phage display system. Following 

30 several rounds of screening, cDNA from the enriched pool of polysomes was cloned into 
a phagemid vector. This vector serves as both a peptide expression vector, displaying 
peptides fused to the coat proteins, and as a DNA sequencing vector for peptide 
identification. By expressing the polysome-derived peptides on phage, one can either 
continue the affinity selection procedure in this format or assay the peptides on 

35 individual clones for binding activity in a phage ELISA, or for binding specificity in a 
completion phage ELISA (Barret, et al. (1992) Anal. Biochem 204,357-364). To 



wo 98/18323 



PCT/US97/19575 



-48- 

identify the sequences of the active peptides one sequences the DNA produced by the 
phagemid host. 

Secondary Screening of Polypeptides and Analogs 
5 The high through-put assays described above can be followed by secondary 

screens in order to identify further biological activities which will, e.g., allow one skilled 
in the art to differentiate agonists from antagonists. The type of a secondary screen used 
will depend on the desired activity that needs to be tested. For example, an assay can be 
developed in which the ability to inhibit an interaction between a protein of interest and 
10 its respective ligand can be used to identify antagonists from a group of peptide 
fragments isolated though one of the primary screens described above. 

Therefore, methods for generating fragments and analogs and testing them for 
activity are known in the art. Once the core sequence of interest is identified, it is 
routine for one skilled in the art to obtain analogs and fragments. 

15 

Peptide Mimetics of H. pylori Polypeptides 

The invention also provides for reduction of the protein binding domains of the 
subject K pylori polypeptides to generate mimetics, e.g. peptide or non-peptide agents. 
The peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, 

20 e.g., in the case of an //. pylori polypeptide binding to a naturally occurring ligand. The 
critical residues of a subject H. pylori polypeptide which are involved in molecular 
recognition of a polypeptide can be determined and used to generate H. pylori-denvcd 
peptidomimetics which competitively or noncompetitively inhibit binding of the //. 
pylori polypeptide with an interacting polypeptide (see, for example, European patent 

25 applications EP-412,762A and EP-B31,080A). 

For example, scanning mutagenesis can be used to map the amino acid residues 
of a particular H, pylori polypeptide involved in binding an interacting polypeptide, 
peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to an interacting polypeptide, and which 

30 therefore can inhibit binding of an H, pylori polypeptide to an interacting polypeptide 
and thereby interfere with the function of H, pylori polypeptide. For instance, non- 
hydrolyzable peptide analogs of such residues can be generated using benzodiazepine 
(e.g., see Freidinger et al, in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988), a^epine (e.g., see Huffman et al. in 

35 Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 

Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry 
and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto- 
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methylene pseudopeptides (Ewenson et al. (19^6) J Med Chem 29:295; and Ewenson et 
al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide 
Symposium) Pierce Chemical Co. Rockland, IL, 1985), p-tum dipeptide cores (Nagai et 
al. (1985) Tetrahedron Lett 26:647; and Sato et al, {\9%6) J Chem Soc Perkin Trans 
1 : 123 1), and P-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res 
Commun\26'A\9\ and Dann et al. (1986) Biochem Biophys Res Commun 134:71). 

VL Vaccine Formulations for K pylori Nucleic Acids and Polypeptides 

This invention also features vaccine compositions or formulations (used 
interchangeably herein) for protection against infection by K pylori or for treatment of 
H. pylori infection. As used herein, the term "treatment of H. pylori infection" refers to 
therapeutic treatment of an existing or established H. pylori infection. The terms 
"protection against H. pylori infection" or "prophylactic treatment" refer to the use of H. 
pylori vaccine formulation for reducing the risk of or preventing an infection in a subject 
at risk for K pylori infection. In one embodiment, the vaccine compositions contain one 
or more immunogenic components, such as a surface protein, from //. pylori, or portion 
thereof, and a pharmaceutically acceptable carrier. For example, in one embodiment, the 
vaccine formulations of the invention contain at least one or combination of H. pylori 
polypeptides or fragments thereof, from same or different H. pylori antigens. Nucleic 
acids and H. pylori polypeptides for use in the vaccine formulations of the invention 
include the nucleic acids and polypeptides set forth in the Sequence Listing, preferably 
those K pylori nucleic acids that encode surface proteins and surface proteins or 
fragments thereof. For example, a preferred nucleic acid and H, pylori polypeptide for 
use in a vaccine composition of the invention is selected from the group of nucleic acids 
which encode cell envelope proteins and K pylori cell envelope proteins as set forth in 
Table 1 . However, any nucleic acid encoding an immunogenic H. pylori protein and K 
pylori polypetide, or portion thereof, can be used in the present invention. These 
vaccines have therapeutic and/or prophylactic utilities. 

One aspect of the invention provides a vaccine composition for protection 
against infection by K pylori which contains at least one immunogenic fragment of an 
//. pylori protein and a pharmaceutically acceptable carrier. Preferred fragments include 
peptides of at least about 10 amino acid residues in length, preferably about 10-20 amino 
acid residues in length, and more preferably about 12-16 amino acid residues in length. 

Immunogenic components of the invention can be obtained, for example, by 
screening polypeptides recombinantly produced from the corresponding fragment of the 
nucleic acid encoding the full-length K pylori protein. In addition, fragments can be 



wo 98/18323 



PCTAJS97/19575 



-50- 

chemically synthesized using techniques known in the art such as conventional 
Merrifield soHd phase f-Moc or t-Boc chemistry. 

In one embodiment, immunogenic components are identified by the ability of the 
peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for 
example, T cell proliferation or cytokine secretion are defined herein as comprising at 
least one T cell epitope. T cell epitopes are believed to be involved in initiation and 
perpetuation of the immune response to the protein allergen which is responsible for the 
clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at 
the level of the T helper cell by binding to an appropriate HLA molecule on the surface 
of an antigen presenting celL thereby stimulating the T cell subpopulation with the 
relevant T cell receptor for the epitope. These events lead to T cell proliferation, 
lymphokine secretion, local inflammatory reactions, recruitment of additional immune 
cells to the site of antigen/T cell interaction, and activation of the B cell cascade, leading 
to the production of antibodies. A T cell epitope is the basic element, or smallest unit of 
recognition by a T cell receptor, where the epitope comprises amino acids essential to 
receptor recognition (e.g., approximately 6 or 7 amino acid residues). Amino acid 
sequences which mimic those of the T cell epitopes are within the scope of this 
invention. 

In another embodiment, immunogenic components of the invention are identified 
through genomic vaccination. The basic protocol is b£ised on the idea that expression 
libraries consisting of all or parts of a pathogen genome, e.g., an //. pylori genome, can 
confer protection when used to genetically immunize a host. This expression library 
immunization (ELI) is analogous to expression cloning and involves reducing a genomic 
expression library of a pathogen, e.g., //. pylori^ into plasmids that can act as genetic 
vaccines. The plasmids can also be designed to encode generic adjuvants which can 
dramatically stimulate the humoral response. These genetic adjuvants can be introduced 
at remote sites and act as well extracelluraly as intracellularly. 

This is a new approach to vaccine production that has many of the advantages of 
live/attenuated pathogens but no risk of infection. An expression library of pathogen 
DNA is used to immunize a host thereby producing the effects of antigen presentation of 
a live vaccine without the risk. For example, in the present invention, random fragments 
from the H, pylori genome or from cosmid or plasmid clones, as well as PGR products 
from genes identified by genomic sequencing, can be used to immunize a host. The 
feasibility of this approach has been demonstrated with Mycoplasma pulmonis (Barry et 
aL, Nature 377 :632-635, 1995), where even partial expression libraries of Mycoplasma 
pulmonis, a natural pathogen in rodents, provided protection against challenge from the 
pathogen. 



wo 98/18323 



PCT/US97/19575 



-51 - 

ELI is a technique that allows for production of a non-infectious multipartite 
vaccine, even when little is known about pathogen's biology, because ELI uses the 
immune system to screen candidate genes. Once isolated, these genes can be used as 
genetic vaccines or for development of recombinant protein vaccines. Thus, ELI allows 
5 for production of vaccines in a systematic, largely mechanized fashion. 

Screening immunogenic components can be accomplished using one or more of 
several different assays. For example, in vitro, peptide T cell stimulatory activity is 
assayed by contacting a peptide knovm or suspected of being immunogenic with an 
antigen presenting cell which presents appropriate MHC molecules in a T cell culture, 
1 0 Presentation of an immunogenic H. pylori peptide in association with appropriate MHC 
molecules"to T"cel Is^irTconj unc t ionT wi thlhe necessary costimulation has the effect of 
transmitting a signal to the T cell that induces the production of increased levels of 
cytokines, particularly of interieukin-2 and interleukin-4. The culture supematant can be _ 
obtained and assayed for interleukin-2 or other known cytokines. For example, any one 

1 5 of several conventional assays for interleukin-2 can be employed, such as the assay ^ 
described in Proc. NatL Acad, Sci USA, 86: 1333 (1989) the pertinent portions of which - 
are incorporated herein by reference. A kit for an assay for the production of interferon 
is also available from Genzyme Corporation (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 

20 thymidine incorporation. The proliferation of T cells can be measured in vitro by 

determining the amount of ^H-labeled thymidine incorporated into the replicating DNA 
of cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell 
division can be quantified. 

Vaccine compositions or formulations of the invention containing one or more 

25 immunogenic components (e.g., H. pylori polypeptide or fragment thereof or nucleic 
acid encoding an K pylori polypeptide or fragment thereof) preferably include a 
pharmaceutically acceptable carrier. The term "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible v^th 

30 pharmaceutical administration. Suitable pharmaceutically acceptable carriers include, 
for example, one or more of water, saline, phosphate buffered saline, dextrose, glycerol, 
ethanol and the like, as well as combinations thereof. Pharmaceutically acceptable 
carriers may further comprise minor amounts of auxiliary substances such as wetting or 
emulsifying agents, preservatives or buffers, which enhance the shelf life or 

35 effectiveness of the H, pylori nucleic acid or polypeptide. For vaccine formulations of 
the invention containing K pylori polypeptides, the polypeptide is preferably 
coadministered with a suitable adjuvant and/or a delivery system described herein. 
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It will be apparent to those of skill in the art that the therapeutically effective 
amount of DNA or protein of this invention will depend, inter alia, upon the 
administration schedule, the unit dose of an K pylori nucleic acid or polypeptide 
administered, whether the protein or nucleic acid is administered in combination with 
5 other therapeutic agents, the immune status and health of the patient, and the therapeutic 
activity of the particular protein or nucleic acid. 

Vaccine formulations are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Methods for intramuscular 
immunization are described by Wolff et al. (1990) Science 247: 1465-1468 and by 

1 0 Sedegah et al. ( 1 994) Immunology 9\: 9866-9870. Other modes of administration 

include oral and pulmonary formulations, suppositories, and transdermal applications. 
Oral immunization is preferred over parenteral methods for inducing protection against 
infection by //. pylori, Czinn et, al. (1993) Vaccine \ \: 637-642. Oral formulations 
include such normally employed excipients as, for example, pharmaceutical grades of 

15 mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 
carbonate, and the like. 

In one embodiment, the vaccine formulation includes, as a pharmaceutically 
acceptable carrier, an adjuvant. Examples of the suitable adjuvants for use in the 
vaccine formulations of the invention include, but are not limited, to aluminum 

20 hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP); N-acetyl-nor- 
muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor-MDP); N- 
acetyImuramyl-L-alanyl-D-isoglutaminyl-L-aIanine-2-(r-2*-dipalmitoyl-sn.glycero-3- 

hydroxyphos-phoryloxy)-ethylamine (CGP 19835 A, referred to a MTP-PE); RIBI, 
which contains three components from bacteria; monophosphoryl lipid A; trehalose 

25 dimycoloate; cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 80 
emulsion; and cholera toxin. Others which may be used are non-toxic derivatives of 
cholera toxin, including its B subunit, and/or conjugates or genetically engineered 
fusions of the H. pylori polypeptide with cholera toxin or its B subunit, 
procholeragenoid, fungal polysaccharides, including schizophyllan, muramyl dipeptide, 

30 muramyl dipeptide derivatives, phorbol esters, labile toxin of coli, non-H, pylori 
bacterial lysates, block polymers or saponins. 

In another embodiment, the vaccine formulation includes, as a pharmaceutically 
acceptable carrier, a delivery system. Suitable delivery systems for use in the vaccine 
formulations of the invention include biodegradable microcapsules or immuno- 

35 stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered 

attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virusrlike 
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particles, e.g., bluetongue. In another embodiment of the invention, the vaccine 
fomiulation includes both a delivery system and an adjuvant. 

Delivery systems in humans may include enteric release capsules protecting the 
antigen from the acidic environment of the stomach, and including H. pylori polypeptide 
in an insoluble form as fusion proteins. Suitable carriers for the vaccines of the 
invention are enteric coated capsules and polylactide-glycolide microspheres. Suitable 
diluents are 0.2 N NaHC03 and/or saline. 

Vaccines of the invention can be administered as a primary prophylactic agent in 
adults or in children, as a secondary prevention, after successful eradication of K pylori 
in an infected host, or as a therapeutic agent in the aim to induce an immune response in 
a susceptible host to prevent infection by K pylori. The vaccines of the invention are 
administered in amounts readily determined by persons of ordinary skill in the art. 
Thus, for adults a suitable dosage will be in the range of 10 p.g to 10 g, preferably 10 |ag 
to 100 mg, for example 50 \ig to 50 mg. A suitable dosage for adults will also be in the 
range of 5 \x% to 500 mg. Similar dosage ranges will be applicable for children. 

The amount of adjuvant employed will depend on the type of adjuvant used. For 
example, when the mucosal adjuvant is cholera toxin, it is suitably used in an amount of 
5 jiig to 50 i-ig, for example 1 0 j^g to 35 |ig. When used in the form of microcapsules, the 
amount used will depend on the amount employed in the matrix of the microcapsule to 
achieve the desired dosage. The determination of this amount is within the skill of a 
person of ordinary skill in the art. 

Those skilled in the art will recognize that the optimal dose may be more or less 
depending upon the patient's body weight, disease, the route of administration, and other 
factors. Those skilled in the art will also recognize that appropriate dosage levels can be 
obtained based on results with known oral vaccines such as, for example, a vaccine 
beised on an E. coli lysate (6 mg dose daily up to total of 540 mg) and with an 
enterotoxigenic E. coli purified antigen (4 doses of 1 mg) (Schulman et al., J. UroL 
150:917-921 (1993)); Boedecker et al., American Gastroenterological Assoc, 999:A-222 
(1993)). The number of doses will depend upon the disease, the formulation, and 
efficacy data from clinical trials. Without intending any limitation as to the course of 
treatment, the treatment can be administered over 3 to 8 doses for a primary 
immunization schedule over 1 month (Boedeker, American Gastroenterological Assoc. 
888:A-222 (1993)). 

In a preferred embodiment, a vaccine composition of the invention can be based 
on a killed whole E. coli preparation with an immunogenic fragment of an //. pylori 
protein of the invention expressed on its surface or it can be based on an E, coli lysate, 
wherein the killed E. coli acts as a carrier or an adjuvant. 
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It will be apparent to those skilled in the art that some of the vaccine 
compositions of the invention are useful only for preventing K pylori infection, some 
are useful only for treating //. pylori infection, and some are useful for both preventing 
and treating //. pylori infection. In a preferred embodiment, the vaccine composition of 
5 the invention provides protection against H, pylori infection by stimulating humoral 
and/or cell-mediated immunity against H, pylori. It should be understood that 
amelioration of any of the symptoms of K pylori infection is a desirable clinical goal, 
including a lessening of the dosage of medication used to treat K pylori-canscd disease, 
or an increase in the production of antibodies in the serum or mucous of patients. 

10 

VII. Antibodies Reactive With H. pylori Polypeptides 

The invention also includes antibodies specifically reactive with the subject H. 
pylori polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
made by standard protocols (See, for example. Antibodies: A Laboratory Manual ed. by 

15 Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a 
hamster or rabbit can be immunized with an immunogenic form of the peptide. 
Techniques for conferring immunogenicity on a protein or peptide include conjugation 
to carriers or other techniques well known in the art. An immunogenic portion of the 
subject H. pylori polypeptide can be administered in the presence of adjuvant. The 

20 progress of immunization can be monitored by detection of antibody titers in plasma or 
serum. Standard ELISA or other immunoassays can be used with the immunogen as 
antigen to assess the levels of antibodies. 

In a prefenred embodiment, the subject antibodies are immunospecific for 
antigenic determinants of the K pylori polypeptides of the invention, e.g. antigenic 

25 determinants of a polypeptide of the invention contained in the Sequence Listing, or a 

closely related human or non-human mammalian homolog (e.g., 90% homologous, more 
preferably at least 95% homologous). In yet a further preferred embodiment of the 
invention, the anti-K pylori antibodies do not substantially cross react (i.e., react 
specifically) with a protein which is for example, less than 80% percent homologous to a 

30 sequence of the invention contained in the Sequence Listing. By "not substantially cross 
react", it is meant that the antibody has a binding affinity for a non-homologous protein 
which is less than 1 0 percent, more preferably less than 5 percent, and even more 
preferably less than 1 percent, of the binding affinity for a protein of the invention 
contained in the Sequence Listing. In a most preferred embodiment, there is no 

35 crossreactivity between bacterial and mammalian antigens. 

The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with H. pylori polypeptides. Antibodies can be fragmented 



Wp 98/18323 



PCT/US97/19575 



-55- 

using conventional techniques and the fragments screened for utility in the same manner 
as described above for whole antibodies. For example, F(ab*)2 fragments can be 
generated by treating antibody with pepsin. The resulting F(ab')2 fragment can be 
treated to reduce disulfide bridges to produce Fab* fragments. The £Uitibody of the 
5 invention is further intended to include bispecific and chimeric molecules having an 
anti-//. pylori portion. 

Both monoclonal and polyclonal antibodies (Ab) directed against K pylori 
polypeptides or //. pylori polypeptide variants, and antibody fragments such as Fab' and 
F(ab')2, can be used to block the action of K pylori polypeptide and allow the study of 

10 the role of a particular H, pylori polypeptide of the invention in aberrant or unwanted 
intracellular signaling, as well as the normal cellular function of the H. pylori and by 
microinjection of anti-//. pylori polypeptide antibodies of the present invention. 

Antibodies which specifically bind K pylori epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 

15 pattem of expression of H. pylori antigens. Anti H. pylori polypeptide antibodies can be 
used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate 
H. pylori levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, 
the ability to monitor H. pylori polypeptide levels in an individual can allow 
determination of the efficacy of a given treatment regimen for an individual afflicted 

20 with such a disorder. The level of an K pylori polypeptide can be measured in cells 
found in bodily fluid, such as in urine samples or can be measured in tissue, such as 
produced by gastric biopsy. Diagnostic assays using anti-// pylori antibodies can 
include, for example, immunoassays designed to aid in early diagnosis of// pylori 
infections. The present invention can also be used as a method of detecting antibodies 

25 contained in samples from individuals infected by this bacterium using specific // pylori 
antigens. 

Another application of anti-// pylori polypeptide antibodies of the invention is 
in the immunological screening of cDNA libraries constructed in expression vectors 
such as >.gtl 1, Xgt 18-23, A.ZAP, and A.0RF8. Messenger libraries of this type, having 

30 coding sequences inserted in the correct reading frame and orientation, can produce 

fusion proteins. For instance, A.gtl 1 will produce fusion proteins whose amino termini 
consist of Q-galactosidase amino acid sequences and whose carboxy termini consist of a 
foreign polypeptide. Antigenic epitopes of a subject // pylori polypeptide can then be 
detected with antibodies, as, for example, reacting nitrocellulose filters lifted from 

35 infected plates with anti-// pylori polypeptide antibodies. Phage, scored by this assay, 
can then be isolated from the infected plate. Thus, the presence of H. pylori gene 



wo 98/18323 



PCT/US97/19575 



- 56- 

homologs can be detected and cloned from other species, and alternate isoforms 
(including splicing variants) can be detected and cloned. 

VTII. Kits Containing Nucleic Acids, Polypeptides or Antibodies of the Invention 
5 The nucleic acid, polypeptides and antibodies of the invention can be combined 

with other reagents and articles to form kits. Kits for diagnostic purposes typically 
comprise the nucleic acid, polypeptides or antibodies in vials or other suitable vessels. 
Kits typically comprise other reagents for performing hybridization reactions, 
polymerase chain reactions (PGR), or for reconstitution of lyophilized components, such 

1 0 as aqueous media, salts, buffers, and the like. Kits may also comprise reagents for 
sample processing such as detergents, chaotropic salts and the like. Kits may also 
comprise immobilization means such as psirticles, supports, wells, dipsticks and the like. 
Kits may also comprise labeling means such as dyes, developing reagents, radioisotopes, 
fluorescent agents, luminescent or chemiluminescent agents, enzymes, intercalating 

15 agents and the like. With the nucleic acid and amino acid sequence information provided 
herein, individuals skilled in art can readily assemble kits to serve their particular 
purpose. Kits further can include instructions for use, 

IX. Drug Screening Assays Using H. pylori Polypeptides 

20 By making available purified and recombinant H. pylori polypeptides, the 

present invention provides assays which can be used to screen for drugs which are either 
agonists or antagonists of the normal cellular function, in this case, of the subject H. 
pylori polypeptides, or of their role in intracellular signaling. Such inhibitors or 
potentiators may be useful as new therapeutic agents to combat H, pylori infections in 

25 humans. A variety of assay formats will suffice and, in light of the present inventions, 
will be comprehended by the skilled artisan. 

In many drug screening programs which test libr^es of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 

30 systems, such as may be derived with purified or semi-purified proteins, are often 

preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by 
a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the 
test compound can be generally ignored in the in vitro system, the assay instead being 

35 focused primarily on the effect of the drug on the molecular target as may be manifest in 
an alteration of binding affinity with other proteins or change in enzymatic properties of 
the molecular target. Accordingly, in an exemplary screening assay of the present 
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invention, the compound of interest is contacted with an isolated and purified H. pylori 
polypeptide. 

Screening assays can be constructed in vitro with a purified H, pylori 
polypeptide or fragment thereof, such as an K pylori polypeptide having enzymatic 
activity, such that the activity of the polypeptide produces a detectable reaction product. 
The efficacy of the compound can be assessed by generating dose response curves from 
data obtained using various concentrations of the test compound. Moreover, a control 
assay can also be performed to provide a baseline for comparison. Suitable products 
include those with distinctive absorption, fluorescence, or chemi-luminescence 
properties, for example, because detection may be easily automated. A variety of 
synthetic or liafufally occurring compounds can be tested in the assay to identify those 
which inhibit or potentiate the activity of the H, pylori polypeptide. Some of these 
active compounds may directly, or with chemical alterations to promote membrane 
permeability or solubility, also inhibit or potentiate the same activity (e.g., enzymatic 
activity) in whole, live H. pylori cells. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references and published patent 
applications cited throughout this application are hereby incorporated by reference. 

EXEMPLIFICATION 

I. Cloning and Sequencing of H. pylori DNA 

H, pylori chromosomal DNA was isolated according to a basic DNA protocol 
outlined in Schleif R.F. and Wensink P.C., Practical Methods in Molecular Biology, 
p.98. Springer- Verlag, NY., 1981, with minor modifications. Briefly, cells were 
pelleted, resuspended in TE (10 mM Tris, 1 mM EDTA, pH 7.6) and GES lysis buffer 
(5.1 M guanidium thiocyanate, 0.1 M EDTA, pH 8.0, 0.5% N-laurylsarcosine) was 
added. Suspension was chilled and ammonium acetate (NH4AC) was added to final 
concentration of 2.0 M. DNA was extracted, first with chloroform, then with phenol- 
chloroform, and reextracted with chloroform. DNA was precipitated with isopropanol, 
washed twice with 70% EtOH, dried and resuspended in TE. 

Following isolation whole genomic K pylori DNA was nebulized (Bodenteich et 
al.. Automated DNA Sequencing and Analysis (J.C. Venter, ed.). Academic Press, 1994) 
to a median size of 2000 bp. After nebulization, the DNA was concentrated and 
separated on a standard 1% agarose gel. Several fractions, corresponding to 
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approximate sizes 900-1300 bp, 1300-1700 bp, 1700-2200 bp, 2200-2700 bp, were 
excised from the gel and purified by the GeneClean procedure (Bio 101, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. 
The healed DNA was then ligated to unique BstXI-linker adapters in 100-1000 fold 
molar excess. These linkers are complimentary to the BstXI-cut pMPX vectors, while 
the overhang is not self-complimentary. Therefore, the linkers will not concatemerize 
nor will the cut-vector religate itself easily. The linker-adopted inserts were separated 
from the unincorporated linkers on a 1% agarose gel and purified using GeneClean. The 
linker-adopted inserts were then ligated to each of the 20 pMPX vectors to construct a 
series of "shotgun" subclone libraries. The vectors contain an out-of-frame lacZ gene at 
the cloning site which becomes in-frame in the event that an adapter-dimer is cloned, 
allowing these to be avoided by their blue-color. 

All subsequent steps were based on the multiplex DNA sequencing protocols 
outhned in Church G.M. and Kieffer-Higgins S., Science 240:185-188, 1988. Only 
major modifications to the protocols are highlighted. Briefly, each of the 20 vectors was 
then transformed into DH5a competent cells (Gibco/BRL, DH5a transformation 
protocol). The libraries were assessed by plating onto antibiotic plates containing 
ampicillin, methicillin and IPTG/Xgal. The plates were incubated overnight at 370C. 
Successful transformants were then used for plating of clones and pooling into the 
multiplex pools. The clones were picked and pooled into 40 ml growth medium 
cultures. The cultures were grown overnight at 37^C. DNA was purified using the 
Qiagen Midi-prep kits and Tip- 100 columns (Qiagen, Inc.), In this manner, 100 \xg of 
DNA was obtained per pool. Fifteen 96-well plates of DNA were generated to obtain a 
5-10 fold sequence redundancy assuming 250-300 base average read-lengths. 

These purified DNA samples were then sequenced using the multiplex DNA 
sequencing based on chemical degradation methods (Church G.M. and Kieffer-Higgins 
S„ Science 240:185-188, 1988) or by Sequithrem (Epicenter Technologies) dideoxy 
sequencing protocols. The sequencing reactions were electrophoresed and transferred 
onto nylon membranes by direct transfer electrophoresis from 40 cm gels (Richterich P. 
and Church G.M., Methods in Enzymology 2U:IS7'222, 1993) or by electroblotting 
(Church, supra), 24 samples were run per gel. 45 successful membranes were produced 
by chemical sequencing and 8 were produced by dideoxy sequencing. The DNA was 
covalently bound to the membranes by exposure to ultraviolet light, and hybridized with 
labeled oligonucleotides complimentary to tag sequences on the vectors (Church, supra). 
The membranes were washed to rinse off non-specifically bound probe, and exposed to 
X-ray film to visualize individual sequence ladders. After autoradiography, the 
hybridized probe was removed by incubation at 65** C, and the hybridization cycle 
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repeated with another tag sequence until the membrane had been probed 38 times for 
chemical sequencing membranes and 10 times for the dideoxy sequencing membranes. 
Thus, each gel produced a large number of films, each containing new sequencing 
information. Whenever a new blot was processed, it was initially probed for an internal 
5 standard sequence added to each of the pools. 

Digital images of the films were generated using a laser-scanning densitometer 
(Molecular Dynamics, Sunnyvale, CA). The digitized images were processed on 
computer workstations (VaxStation 4000*s) using the program REPLICA™ (Church et 
al.. Automated DNA Sequencing and Analysis (J.C. Venter, ed.). Academic Press, 1994). 

10 Image processing included lane straightening, contrast adjustment to smooth out 

interisityciifferences, and re^oliSion enhancement by iterative gaussian deconvolution. 
The sequences were then automatically picked in REPLICA™ and displayed for 
interactive proofreading before being stored in a project database. The proofreading was 
accomplished by a quick visual scan of the film image followed by mouse clicks on the 

1 5 bands of the displayed image to modify the base calls. Many of the sequence errors 
could be detected and corrected because multiple sequence reads covering the same 
portion of the genomic DNA provide adequate sequence redundancy for editing. Each 
sequence automatically received an identification number (corresponding to microtiter 
plate, probe information, and lane set number). This number serves as a permanent 

20 identifier of the sequence so it is always possible to identify the original of any 
particular sequence without recourse to a specialized database. 

Routine assembly of H. pylori sequences was done using the program FALCON 
(Church, Church et al.. Automated DNA Sequenicng and Analysis (J.C. Venter, ed.). 
Academic Press, 1994). This program has proven to be fast and reliable for most 

25 sequences. The assembled contigs were displayed using a modified version of 

GelAssemble, developed by the Genetics Computer Group (GCG) (Devereux et al.. 
Nucleic Acid Res. 12:387-95, 1984) that interacts with REPLICA™. This provided for 
an integrated editor that allows multiple sequence gel images to be instantaneously 
called up fi-om the REPLICA™ database and displayed to allow rapid scanning of 

30 contigs and proofreading of gel traces where discrepEincies occurred between different 
sequence reads in the assembly. 



II. Identification, cloning and expression of recombinant H. pylori DNA sequences 

To facilitate the cloning, expression and purification of membrane and secreted 
35 proteins from H. pylori a powerful gene expression system, the pET System (Novagen), 
for cloning and expression of recombinant proteins in E, coli, was selected. Also, a 
DNA sequence encoding a peptide tag, the His-Tag, was fused to the 3* end of DNA 
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sequences of interest in order to facilitate purification of the recombinant protein 
products. The 3' end was selected for fusion in order to avoid alteration of any 5' 
terminal signal sequence. The exception to the above was ppiB, a gene cloned for use as 
a control in the expression studies. In this study, the sequence for K pylori ppiB 
5 contains a DNA sequence encoding a His-Tag fused to the 5' end of the full length gene, 
because the protein product of this gene does not contain a signal sequence and is 
expressed as a cytosolic protein. 



PCR Amplification and cloning of DNA sequences containing ORFsfor membrane and 

1 0 secreted proteins from the J99 Strain of Helicobacter pylori. 

Sequences chosen (from the list of the DNA sequences of the invention) for 
cloning from the J99 strain of K pylori were prepared for amplification cloning by 
polymerase chain reaction (PCR). Synthetic oligonucleotide primers (Table 3) specific 
for the 5' and 3' ends of open reading frames (ORFs) were designed and purchased 

15 (GibcoBRL Life Technologies, Gaithersburg, MD, USA). All forward primers (specific 
for the 5' end of the sequence) were designed to include an Ncol cloning site at the 
extreme 5' terminus, except for HpSeq. 4821082 where Ndel was used. These primers 
were designed to permit initiation of protein translation at a methionine residue followed 
by a valine residue and the coding sequence for the remainder of the native H. pylori 

20 DNA sequence. An exception is H, pylori sequence 4821082 where the initiator 
methionine is immediately followed by the remainder of the native H. pylori DNA 
sequence. All reverse primers (specific for the 3' end of any K pylori ORF) included a 
EcoRI site at the extreme 5' terminus to permit cloning of each K pylori sequence into 
the reading frame of the pET-28b. The pET-28b vector provides sequence encoding an 

25 additional 20 carboxy-terminal amino acids (only 19 amino acids in HpSeq. 26380318 
and HpSeq. 14640637) including six histidine residues (at the extreme C-temiinus), 
which comprise the His-Tag. An exception to the above, as noted earlier, is the vector 
construction for the ppiB gene. A synthetic oligonucleotide primer specific for the 5' 
end of ppiB gene encoded a BamHl site at its extreme 5' temiinus and the primer for the 

30 3' end of the ppiB gene encoded a Xhol site at its extreme 5' terminus. 
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TABLE 3 

Oligonucleotide primers used for PGR amplification of H. pylori DNA sequences 



Outer membrane 
Proteins 


Forward primer 5' to 3' 


Reverse Primer 5' to 3' 


Protein 16225006 


5'-TATACCATGGTGGG 
CGCTAA-3' (SEQ ID 
NO: 147) 


5'- 

ATGAATTCGAGTAAG 
GATTTTTG-3' (SEQ ID 
NO: 148) 


Protein 26054702 


5'- 

TTAACCATGGTGAAA 


5'- 

TAGAATTCGCATAAC 


AGCGATA-3' (SEQ ID 
NO: 149) 


GATCAATC-3* (SEQ ID 
NO: 150) 


Protein 71 16626 


5'- 

ATATCCATGGTGAGT 
TTGATGA-3' (SEQ ID 
NO:151) 


5'- 

ATGAATTCAATTTTT 

TArrnGCCA-3' (seq 

ID NO: 152) 


Protein 29479681 


5'- 

AATTCCATGGTGGGG 
GCTATG-3' (SEQ ID 
NO: 153) 


5'- 

ATGAATTCTCGATAG 
CCAAAATC-3' (SEQ ID 
NO: 154) 


Protein 14640637 


5'- 

AATTCCATGGTGCAT 
AACTTCCATT-3' (SEQ 
ID NO: 155) 


5'- 

AAGAATTCTCTAGCA 
TCCAAATGGA-3' (SEQ 
ID NO: 156) 


Periplasmic/ Secreted 
Proteins 






Protein 30100332 


5'-ATTTCCATGGTCATG 
TCTCATATT-3' (SEQ ID 
NO: 157) 


5'- 

ATGAATTCCATCTTT 
TATTCCAC-3' (SEQ ID 
NO:158) 


Protein 4721061 


5'-AACCATGGTGATTT 
TAAGCATTGAAAG-3' 
(SEQ ID NO: 159) 


5'- 

AAGAATTCCACTCA 
AAATTnTTAACAG-3' 
(SEQ ID NO: 160) 


Other Surface Proteins 






Protein 4821082 


5'-GATCATCCATATGTT 
ATCTTCTAAT-3' (SEQ 
ID NO:161) 


5'- 

TGAATTCAACCATTT 
TAACCCTG-3' (SEQ ID 
NO: 162) 
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Protein 978477 


5'-TATACCATGGTGAA 
ATTTTTTCTTTTA-3' 
(SEQ ID NO: 163) 


5'- 

AGAATTCAATTGCG 
TC TT GTAAAACj-J 
(SEQ ID NO: 164) 


Inner Membrane 
Protein 






Protein zoJsOila 


TATAPr'ATGGTnAT 
GGACAAACTC-3' (SEQ 
ID NO: 165) 


5'-ATGAATTCCCACTT 
GGGGCGATA-3* (SEQ 
ID NO: 166) 


Cytoplasmic Protein 






ppi 


5'-TTATGGATCCAAAC 
CAATTAAAACT-3' (SEQ 
ID NO: 167) 


5'-TATCTCGAGTTATA 
GAGAAGGGC-3* (SEQ 
ID NO: 168) 



Genomic DNA prepared from the J99 strain of H. pylori (ATCC #55679; 
deposited by Genome Therapeutics Corporation, 100 Beaver Street, Waltham, MA 
02154) was used as the source of template DNA for PCR amplification reactions 
5 (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et ai., 
eds., 1994). To amplify a DNA sequence containing an H. pylori ORE, genomic DNA 
(50 nanograms) was introduced into a reaction vial containing 2 mM MgCl2, 1 
micromolar synthetic oligonucleotide primers (forward and reverse primers) 
complementary to and flanking a defined H. pylori ORE, 0.2 mM of each 
1 0 deoxynucleotide triphosphate; dATP, dGTP. dCTP, dTTP and 2.5 units of heat stable 
DNA polymerase (Amplitaq, Roche Molecular Systems. Inc., Branchburg, NJ, USA) in 
a final volume of 100 microliters. The following thermal cycling conditions were used 
to obtain amplified DNA products for each ORE using a Perkin Elmer Cetus/ GeneAmp 
PCR System 9600 thermal cycler: 

15 

Protein 26054702, Protein 7116626, Protein 29479681, Protein 30100332, and 
Protein 4821082; 

Denaturation at 94^0 for 2 min, 

2 cycles at 94^0 for 15 sec, 30^0 for 15 sec and 72^0 for 1.5 min 
20 23 cycles at 940c for 15 sec, 550c for 15 sec and 720C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

Protein 16225006; 

Denaturation at 94^C for 2 min. 
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25 cycles at 95^C for 15 sec, 55^C for 15 sec and 72^C for 1.5 min 
Reaction was concluded at 72^C for 6 minutes. 

Protein 4721061; 

Denaturation at 94^C for 2 min, 

2 cycles at 94^C for 15 sec, 36^C for 15 sec and 72^C for 1.5 min 
23 cycles at 94^C for 15 sec, 60^0 for 15 sec and 72^C for 1.5 min 
Reactions were concluded at 72^C for 6 minutes. 

Protein 26380318; 

Denaturation at 94^CT6F2~rnini 

2 cycles at 94^C for 15 sec, 38<^C for 15 sec and 72^C for 1.5 min 
23 cycles at 94^0 for 15 sec, 62^C for 15 sec and 72^0 for 1 .5 min 
Reactions were concluded at 72^C for 6 minutes. 

Protein 14640637; 

Denaturation at 94^C for 2 min, 

2 cycles at 94^C for 15 sec, 33^C for 15 sec and 72^C for 1.5 min 
30 cycles at 94^C for 15 sec, 55^0 for 15 sec and 72^C for 1.5 min 
Reactions were concluded at 72^C for 6 minutes. 

Conditions for amplification of //I pylori ppiB; 
Denaturation at 94^C for 2 min, 

2 cycles at 94^C for 15 sec, 32^0 for 15 sec and 72^0 for 1.5 min 
25 cycles at 94^C for 15 sec, 56^0 for 15 sec and 72^0 for 1 .5 min 
Reactions were concluded at 72^C for 6 minutes 

Upon completion of thermal cycling reactions, each sample of amplified DNA 
was washed and purified using the Qiaquick Spin PGR purification kit (Qiagen, 
Gaithersburg, MD, USA). All amplified DNA samples were subjected to digestion with 
the restriction endonucleases, Ncol and EcoRI (New England BioLabs, Beverly, MA, 
USA), or in the case of HpSeq. 4821082 (SEQ ID NO: 1309), with Ndel and EcoRI 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., 
eds., 1994). DNA samples were then subjected to electrophoresis on 1.0 % NuSeive 
(FMC BioProducts, Rockland, ME USA) agarose gels. DNA was visualized by 
exposure to ethidium bromide and long wave uv irradiation. DNA contained in slices 
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isolated from the agarose gel was purified using the Bio 101 GeneCleEUi Kit protocol 
(Bio 101 Vista, CA, USA). 

Cloning of H. pylori DNA sequences into the pET'28b prokaryotic expression vector. 

The pET-28b vector was prepared for cloning by digestion with Ncol and EcoRJ, 
or in the case of //. pylori protein 4821082 with Ndel and EcoRI (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F, Ausubel et aL, eds., 1994). In the case 
of cloning ppiB, the pET-28a vector, which encodes a His-Tag that can be fused to the 
5' end of an inserted gene, was used and the cloning site prepared for cloning with the 
ppiB gene by digestion with BamHI and Xhol restriction endonucleases. 

Following digestion, DNA inserts were cloned (Current Protocols in Moleculeir 
Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) into the previously 
digested pET-28b expression vector, except for the amplified insert for ppiB, which was 
cloned into the pET-28a expression vector. Products of the ligation reaction were then 
used to transform the BL21 strain of E, coli (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et aL, eds., 1994) as described below. 

Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain BL21 or E. coli strain BL21 (DE3), were 
transformed with recombinant pET expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular, John Wiley 
and Sons, Inc., F. Ausubel et aL, eds., 1994). Briefly, 1 microliter of ligation reaction 
was mixed with 50 microliters of electrocompetent cells and subjected to a high voltage 
pulse, after which, samples were incubated in 0.45 milliliters SOC medium (0.5% yeast 
extract, 2.0 % tryptone, 10 mM NaCl, 2.5 mM KCl, 10 mM MgC12, 10 mM MgS04 and 
20, mM glucose) at 37^C with shaking for 1 hour. Samples were then spread on LB 
agar plates containing 25 microgram/ml kanamycin sulfate for growth overnight. 
Transformed colonies of BL21 were then picked and analyzed to evaluate cloned inserts 
as described below. 

Identification of recombinant pET expression plasmids carrying H. pylori sequences 

Individual BL21 clones transformed with recombinant pET-28b-H.pylori ORFs 
were analyzed by PCR amplification of the cloned inserts using the same forward and 
reverse primers, specific for each H. pylori sequence, that were used in the original PCR 
amplification cloning reactions. Successful amplification verified the integration of the 
H. pylori sequences in the expression vector (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et aL, eds., 1994). 
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Isolation and Preparation of plasmid DNA from BL2I transformants 

Individual clones of recombinant pET-28b vectors carrying properly cloned H, 
pylori ORPs were picked and incubated in 5 mis of LB broth plus 25 microgram/ml 
kanamycin sulfate overnight. The following day plasmid DNA was isolated and 
purified using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, 
USA). 

Expression of recombinant H. pylori sequences in E, coli 

The pET vector can be propagated in any E, coli K-12 strain e.g. HMS174, 
HBlOl, JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for 
expression include £. coli strains containing a chromosomal copy of the gene for T7 
RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative 
that carries the lacl gene, the lacUV5 promoter and the gene for T7 RNA polymerase. 
T7 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside (IPTG), 
and the T7 RNA polymerase transcribes any target plasmid, such as pET-28b, carrying a 
T7 promoter and a gene of interest. Strains used include: BL21(DE3) (Studier, F.W., 
Rosenberg, A.H., Dunn, J. J., and Dubendorfif, J.W. (1990) Meth. Enzymol. 185, 60-89). 

To express recombinant //. pylori sequences, 50 nanograms of plasmid DNA 
isolated as described above was used to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression system kit). The 
lacZ gene (beta-galactosidase) was expressed in the pET-System as described for the K 
pylori recombinzint constructions. Transformed cells were cultured in SOC medium for 
1 hour, and the culture was then plated on LB plates containing 25 micrograms/ml 
kanamycin sulfate. The following day, bacterial colonies were pooled and grown in LB 
medium containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 
nM of 0.5 to 1.0 O.D. units, at which point, 1 millimolar IPTG was added to the culture 
for 3 hours to induce gene expression of the K pylori recombinant DNA constructions. 

After induction of gene expression with IPTG, bacteria were pelleted by 
centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4^C. Pellets 
were resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 
mM EDTA (STE buffer). Cells were then centrifuged at 2000 x g for 20 min at 4^0. 
Wet pellets were weighed and frozen at -80^C until ready for protein purification. 

IIL Purification of recombinant proteins from E. coli 
Analytical Methods 

The concentrations of purified protein preparations were quantified 
spectrophotometrically using absorbance coefficients calculated from amino acid 
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content (Perkins, S.J, 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations were 
also measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, 
and Lowry, O.H., Rosebrough, N., Farr, A.L. & Randall, R.J. (1951) J. Biol. Chem. 193, 
pages 265-275, using bovine serum albumin as a standard. 

SDS-polyacrylamide gels (12% or 4,0 to 25 % acrylamide gradient gels) were 
purchased from BioRad (Hercules, CA, USA), and stained with Coomassie blue. 
Molecular weight markers included rabbit skeletal muscle myosin (200 kDa), E. coli (- 
galactosidase (1 16 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum 
albumin (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (3 1 kDa), soybean 
trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 
kDa). 

1. Purification of soluble proteins 

All steps were carried out at 4^C. Frozen cells were thawed, resuspended in 5 
volumes of lysis buffer (20 mM Tris, pH 7.9, 0.5 M NaCl, 5 mM imidazole with 10% 
glycerol, 0. 1 % 2-mercaptoethanol, 200 ml lysozyme, 1 mM phenylmethylsulfonyl 
fluoride (PMSF), and 10 ug/ml each of leupeptin, aprotinin, pepstatin, L-l-chloro-3-[4- 
tosylamido]-7-amino-2-heptanone (TLCK), L- 1 -chloro-3-[4-tosylamido]-4-phenyl-2- 
butanone (TPCK), and soybean trypsin inhibitor, and ruptured by several passages 
through a small volume microfluidizer (Model M-1 lOS, Microfluidics International 
Corporation, Nev^on, MA). The resultant homogenate was made 0.1 % Brij 35, and 
centrifuged at 100,000 x g for 1 hour to yield a clear supernatant (crude extract). 

Following filtration through a 0,8 |am Supor filter (Gelman Sciences, FRG) the 
crude extract was loaded directly onto a Ni^"^- nitrilotriacetate-agarose (NTA) with a 5 
milliliter bed volume (Hochuli, E., Dbeli, H., and Schacheer, A. (1987) J. 
Chromatography 411, 177-184) pre-equilibrated in lysis buffer containing 10 % 
glycerol, 0.1 % Brij 35 and 1 mM PMSF. The column was washed with 250 ml (50 bed 
volumes) of lysis buffer containing 10 % glycerol, 0.1 % Brij 35, and was eluted with 
sequential steps of lysis buffer containing 10 % glycerol, 0.05 % Brij 35, 1 mM PMSF, 
and 20, 100, 200, and 500 mM imidazole in succession. Fractions were monitored by 
absorbance at OD28O ^» fractions were analyzed by SDS-PAGE. Fractions 

containing the recombinant protein eluted at 100 mM imidazole. 

Recombinant protein 14640637 and proteins, beta-galactosidase (lacZ) andpeptidyl- 
prolyl ciS'trans isomerase (ppiB) 

Fractions containing the recombinant proteins from the Ni^'*"-NTA-agarose 
columns were pooled and then concentrated to approximately 5 ml by centrifugal 
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filtration (Centriprep-lO, Amicon, MA), and loaded directly onto a 180-ml column (1.6 
X 91 cm) of Sephacryl S-100 HR gel filtration medium equilibrated in Buffer A (10 mM 
Hepes, pH 7.5, 150 mM NaCl, 0.1 mM EGTA) and run in Buffer A at 18 mi/h. 
Fractions containing the recombinant protein were identified by absorbance at 280 nm 
and analyzed by SDS-PAGE. Fractions were pooled and concentrated by centrifugal 
filtration. 

Recombinant protein 7116626 

Fractions containing the recombinant protein from the Ni^"*" -NTA-agarose 
column were pooled and dialyzed overnight against 1 liter of dialysis buffer (10 mM 
MOPS, pH 6.5, 50 mM NaCl, 0.1 mM EGTA, 0.02% Brij 35 and 1 mM PMSF). In the 
morning, a fine white precipitate was removed by centrifugation and the resulting 
supernatant was loaded onto an 8 ml (8 x 75 mm) MonoS high performance liquid 
chromatography column (Pharmacia Biotechnology, Inc.. Piscataway, NJ, USA) 
equilibrated in buffer B (10 mM MOPS, pH 6.5, 0.1 mM EGTA) containing 50 mM 
NaCl. The column was washed with 10 bed volumes of buffer B containing 50 mM 
NaCl, and developed with a 50-ml linear gradient of increasing NaCl (50 to 500 mM). 
Recombinant protein 71 16626 eluted as a sharp peak at 300 mM NaCl. 

2. Purification of insoluble proteins from inclusion bodies 

The following steps were carried out at 4^C. Cell pellets were resuspended in 
lysis buffer with 10% glycerol 200 fig/ ml lysozyme, 5 mM EDTA, ImM PMSF and 0.1 
% -mercaptoethanol. After passage through the cell disrupter, the resulting homogenate 
was made 0.2 % deoxycholate, stirred 10 minutes, then centrifuged at 20,000 x g, for 30 
min. The pellets were washed with lysis buffer containing 10 % glycerol, 10 mM 
EDTA, 1% Triton X-100, 1 mM PMSF and 0.1% -mercaptoethanol, followed by several 
washes with lysis buffer containing 1 M urea, 1 mM PMSF and 0.1 % 2- 
mercaptoethanol. The resulting white pellet was composed primarily of inclusion 
bodies, free of unbroken cells and membranous materials.. 

Recombinant proteins 2605470Z 16225006, 30100332. 4721061 

The following steps were carried out at room temperature. Purified inclusion 
bodies were dissolved in 20 ml 8.0 M urea in lysis buffer with 1 mM PMSF and 0.1 % 
2-mercaptoethanol, and incubated at room temperature for 1 hour. Materials that did not 
dissolve were removed by centrifugation. The clear supernatant was filtered, then 
loaded onto a Ni2+ -NTA agarose column pre-equilibrated in 8.0 M urea in Lysis 
Buffer. The column was washed with 250 ml (50 bed volumes) of lysis buffer 
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containing 8 M urea, 1 .0 mM PMSF and 0. 1 % 2-inercaptoethanol, and developed with 
sequential steps of lysis buffer containing 8M urea, 1 mM PMSF, 0.1 % 2- 
mercaptoethanol and 20, 100, 200, and 500 mM imidazole in succession. Fractions 
were monitored by absorbance at OD28O P^^^ fractions were analyzed by SDS- 

5 PAGE. Fractions containing the recombinant protein eluted at 100 mM imidazole. 

Recombinant proteins 29479681 26380318 

The pellet containing the inclusion bodies was solubilized in buffer B containing 
8 M urea, 1 mM PMSF and 0.1 % 2-mercaptoethanoK and incubated for 1 hour at room 

1 0 temperature. Insoluble materials were removed by centrifugation at 20,000 x g for 30 
min, and the cleared supernatant was loaded onto a 15 ml ( 1.6 x 7.5 cm ) SP-Sepharose 
column pre-equilibrated in buffer B, 6 M urea, 1 mM PMSF, 0.1 % 2-mercaptoethanoL 
After washing the column with 10 bed volumes, the column was developed with a linear 
gradient from 0 to 500 mM NaCl. 

1 5 Dialysis and concentration of protein samples 

Urea was removed slowly from the protein samples by dialysis against Tris- 
buffered saline (TBS; 10 mM Tris pH 8.0, 150 mM NaCI) containing 0.5 % 
deoxycholate (DOC) with sequential reduction in urea concentration as follows; 6M, 
4M, 3M, 2M, IM, 0.5 M and finally TBS without any urea. Each dialysis step was 

20 conducted for a minimum of 4 hours at room temperature. 

After dialysis, samples were concentrated by pressure filtration using Amicon 
stirred-cells. Protein concentrations were measured using the methods of Perkins (1986 
Eur. J. Biochem. 157, 169-180), Bradford ((1976) Anal. Biochem. 72, 248-254) and 
Lowry ((1951) J. Biol. Chem. 193, pages 265-275). 

25 The recombinant proteins purified by the methods described above are 

summarized in Table 4 below. 



TABLE 4 



J99 
Sequence 
Identifier 


Homolog 
identified 
by Blast 


Gene 
symbol 
of 

Homolog 


Bacterial ceil 
fraction used to 
purify 
recombinant 
proteins 


Method of 
purification 


Relative 
MWon 
SDS- 
PAGE gel 


Final 
concentratio 
n of purified 
protein 


Composit 
ionof 
buffer 


















Outer Memb 


»rane Proteins 




















16225006 


P28635 


YEAC 


Inclusion bodies 


His-Tag 


ISkDa 


5 mg/ml 


B 


















26054702 


PI 5929 


flgH 


inclusion bodies 


His-Tag 


37 kDa 


1.18 mg/mi 


B 
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— 


as dry 
pellet 


















7116626 


P26093 


e(P4) 


Soluble fraction 


His-Tag 


29 kDa 


0.8 mg/ml 


A 














1.85 mg/ml 


C 


















29479681 


P13036 


fecA 


Inclusions 
bodies 


sp. 

Seph arose 


23 kDa 


2.36 mg/ml 


B 














0.5 mg ml 


B 
















as dry 
pellet 


















14640637 


PI 6665 


TPFl 


Soluble fraction 


His-Tag 


1 7 kDa 


2.4 mg/ml 


A 










ge! filtration S 1 00 HR 



















Periplasm ic/Secreted Protein 



















3010032 


P23847 


dppA 


Inclusion bodies 


His-Tag 


1 1 kDa 


2.88 mg/mi 


B 


















4721061 


P36175 


GCP 


Inclusion bodies 


His-Tag 


38 kDa 


2.8 mg/ml 


B 



















Other Surface Proteins 



















4821082 


P08089 


M 
protein 


Inclusion bodies 


His-Tag 


20 kDa 


1.16 mg/ml 


B 


















978477 


L28919 


FBP54 


Inclusion bodies 


SP- 
Sepharose 


44 kDa 


2.56 mg/ml 


B 














0.3 mg/ml 


B 



Inner Membrane Proteins 



















26380318 


PI 5933 


niG 


Inclusion bodies 


SP- 
Sepharose 


1 1 kDa 


22 mg/mi 


B 



































Control Proteins with His-Tag 





















P00722 


lacZ 


Soluble fraction 


His-Tag 


116 kDa 


10 mg/ml 


A 










gel filtration S200 HR 
























ppiB 


Soluble fraction 


His-Tag 


21 kDa 


4.4 mg/ml 


A 










gel filtration SlOO HR 




Buffer 
composition 

s: 
















A-10 mM Hepes pH 7.5, 150 mM NaCI, 0.1 mM EGTA 










B= 10 mM Tris pH 8.0, 150 mM NaCl, 0.5 % DOC 










C= 10 mM MOPS pH 6.5, 300 mM NaCl, 0.1 EGTA 










II III 
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IV. Analysis of K pylori proteins as Vaccine candidates 

To investigate the immunomodulatory effect of K pylori proteins, a mouse/H 
pylori model was used. This model mimics the human H. pylori infection in many 
respects. The focus is on the effect of oral immunization in H, pylori infected animals 
5 in order to test the concept of therapeutic oral immunotherapy. 

Animals 

Female SPF BALB/c mice were purchased from Bomholt Breeding center 
(Denmark). They were kept in ordinary makrolon cages with free supply of water and 
10 food. The animals were 4-6 weeks old at arrival. 
Infection 

After a minimum of one week of acclimatization, the animals were infected with 
a type 2 strain (VacA negative) of K pylori (strain 244, originally isolated from an ulcer 
patient). In our hands, this strain has earlier proven to be a good colonizer of the mouse 
15 stomach. The bacteria were grown ovemight in Brucella broth supplemented with 10 % 
fetal calf serum, at iTC in a microaerophilic atmosphere (10% CO2, 5%02). The 
animals were given an oral dose of omeprazole (400 jimol/kg) and 3-5 h after this an 
oral inoculation of//, pylori in broth (approximately 10^ cfu/animal). Positive take of 
the infection was checked in some animals 2-3 weeks after the inoculation. 

20 

Antigens 

Recombinant K pylori antigens were chosen based on their association v^th 
externally exposed K pylori cell membrane. These antigens were selected from the 
following groups: (1.) Outer Membrane Proteins; (2.) Periplastic/Secreted proteins; (3.) 
25 Outer Surface proteins; and (4.) Inner Membrane proteins. All recombinant proteins 

were constructed with a hexa-HIS tag for purification reasons and the non-Helicobacter 
pylori control protein (b-galactosidase from E, coli; LacZ), was constructed in the same 
way. 

All antigens were given in a soluble form, i.e. dissolved in either a HEPES buffer 
30 or in a buffer containing 0.5% Deoxycholate (DOC). 

The antigens are listed in Table 5 below. 

Table 5 

Helicobacter pylori proteins 

35 

Outer membrane Proteins 

Protein 7116626 
Protein 4721061 
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Protein 16225006 
Protein 29479681 
Protein 14640637 

5 Periplasmic/Secreted Proteins 

Protein 30100332 

Other cell envelope proteins 

Protein 4821082 

10 

Flagella-associated proteins 

Protein 263803 18 

Control proteins 

1 5 b-galactosidase (LacZ) 

Immunizations 

Ten animals in each group were immunized 4 times over a 34 day period (day 1, 
15, 25 and 35). Purified antigens in solution or suspension were given at a dose of 100 

20 mg/mouse. As an adjuvant, the animals were also given 10 (ig/mouse of Cholera toxin 
(CT) with each immunization. Omeprazole (400 mmol/kg) was given orally to the 
animals 3-5 h prior to immunization as a way of protecting the antigens from acid 
degradation. Infected control animals received HEPES buffer + CT or DOC buffer + 
CT. Animals were sacrificed 2-4 weeks after final immunization. A general outline of 

25 the study is shown in Table 6 below. 



Table 6 

Study outline, therapeutic immunization: 

Mice were all infected with H. pylori strain Ah244 at day 30. 



Substance 


Mouse strain 
1^10 


Dose/mouse 


Dates for 
dosine 


1. Controls, PBS 


Balb/c 


0.3 ml 


0, 14, 24, 34 


2. Cholera toxin, 10 ng 


Balb/c 


0.3 ml 


0, 14, 24, 34 


3. Protein 16225006, 100 ng + CT 10 ngBalb/c 


0.3 ml 


0, 14,24,34 


4. Protein 26054702, 100 ^ig + CT 10 ngBalb/c 


0.3 ml 


0, 14, 24,34 
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5. Protein 26380318, 100 jig + CT 10 ngBalb/c 


0.3 ml 


0, 14, 24, 34 


6. Protein 29479681, 100 ng + CT 10 ^gBalb/c 


0.3 ml 


0. 14, 24, 34 


7. Protein 30100332, 100 |ig + CT 10 ngBalb/c 


0.3 ml 


0, 14, 24, 34 


8. Protein 4721061, 100 ^g + CT 10 ^g Balb/c 


0.3 ml 


0, 14,24, 34 


9. Protein 4821082, 100 ^g + CT 10 ng Balb/c 


0.3 ml 


0, 14, 24, 34 


10. Protein 71 16626, 100 |ig + CT 10 ng Balb/c 


0.3 ml 


0, 14, 24, 34 


1 1. Protein 14640637, 100 ^g + CT 10 ^g Balb/c 


0.3 ml 


0, 14, 24, 34 



1 5 Analysis of infection 

Mucosal infection: The mice were sacrificed by CO2 and cervical dislocation. 
The abdomen was opened and the stomach removed. After cutting the stomach along 
the greater curvature, it was rinsed in saline. The mucosa from the antrum and corpus of 
an area of 25mm^ was scraped separately with a surgical scalpel. The mucosa scraping 
20 was suspended in Brucella broth and plated onto Blood Skirrow selective plates. The 
plates were incubated under microaerophilic conditions for 3-5 days and the number of 
colonies was counted. The identity of H. pylori was ascertained by urease and catalase 
test and by direct microscopy or Gram staining. 

The urease test was performed essentially as follows. The reagent. Urea Agar 
25 Base Concentrate, was purchased from DIFCO Laboratories, Detroit, MI (Catalog # 
0284-6 1 -3). Urea agar base concentrate was diluted 1:10 with water. 1 ml of if the 
diluted concentrate was mixed with 100-200 ml of actively growing K pylori cells. 
Color change to magenta indicated that cells were urease positive. 

The catalase test was performed essentially as follows. The reagent, N,N,N',N*- 
30 Tetramethyl-p-Phenylenediamine, was purchased from Sigma, St. Louis, MO (Catalog # 
T3134), A solution of the regent (1% w/v in water) was prepared. K pylori cells were 
swabbed onto Whatman filter paper and overlaid with the 1% solution. Color change to 
dark blue indicated that the cells were catalase positive. 

Serum antibodies: From all mice serum was prepared from blood drawn by heart 
35 puncture. Serum antibodies were identified by regular ELISA techniques, where the 
specific antigens of Helicobacter pylori were plated. 

Mucosal antibodies: Gentle scrapings of a defined part of the corpus and of 4 cm 
of duodenum were perfomied in 50% of the mice in order to detect the presence of 
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antibodies in the mucous. The antibody titers were determined by regular ELISA 
technique as for serum antibodies. 

Statistical analysis: Wilcoxon-Mann- Whitney sign rank test was used for 
determination of significant effects of the antigens on Helicobacter pylori colonization. 
5 P<0.05 was considered significant. Because the antrum is the major colonization site for 
Helicobacter most emphasis was put upon changes in the antral colonization. 

Results 

Antibodies in sera : All antigens tested given together with CT gave rise to a 
10 measurable specific titer in serum. The highest responses were seen with Protein 

7116626, Protein 4721061, Protein 26380318, Protein 14640637 and Protein 4821082 
(see Figure 1). 

Antibodies in mucus : In the mucus scrapings, specific antibodies against all 
antigens tested were seen. By far the strongest response was seen with Protein 
15 30100332, followed by Protein 14640637, and Protein 263803 1 8 (see Figure 2). 



Therapeutic immunization effects: 

All control animals (B ALB/c mice) were well colonized with H pylori (strain 
AH244) in both antrum and corpus of the stomach. Of the antigens tested 3 proteins 

20 (Protein 4721061, Protein 4821082, and Protein 14640637) gave a good and significant 
reduction and/or eradication of the H. pylori infection. The degree of colonization of the 
antrum was lower following immunization with Protein 71 16626 and Protein 26380318 
compared to control. The effect of Proteins 16225006, 29479681, and 30100332 did not 
differ from control- The control protein lacZ, i.e. the non-H. pylori protein, had no 

25 eradication effect and in fact had higher Helicobacter colonization compared to the 
HEPES + CT control. AH data are shown in Figures 3 and 4 for proteins dissolved in 
HEPES and DOC respectively. Data is shown as geometric mean values. n=8-10 
Wilcoxon-Mann- Whitney sign rank test * = p<0.05; x/10 = number of mice showing 
eradication of H pylori over the total number of mice examined. 

30 

The data presented indicate that all of the H pylori associated proteins included 
in this study, when used as oral immunogens in conjunction with the oral adjuvant CT, 
resulted in stimulation of an immune response as measured by specific serum and 
mucosal antibodies. A majority of the proteins led to a reduction, and in some cases 
35 complete clearance of the colonization of H pylori in this animal model. It should be 
noted that the reduction or clearance was due to heterologous protection rather than 
homologous protection (the polypeptides were based on the K pylori J99 strain 



wo 98/18323 



PCT/US97/19575 



10 



-74- 

sequence and used in the therapeutic immunization studies against a different (AH244) 
challenge strain, indicating the vaccine potential against a wide variety ofK pylori 
strains. 

The highest colonization in the antrum was seen in animals treated with the non- 
Helicobacter protein LacZ, indicating that the effects seen with the Helicobacter pylori 
antigens were specific. 

Taken together these data strongly support the use of these H. pylori proteins in a 
pharmaceutical formulation for the use in humans to treat and/or prevent H pylori 
infections. 



V. Sequence Variance Analysis of genes in Helicobacter pylori strains 

Four genes were cloned and sequenced from several strains ofH pylori to 
compare the DNA and deduced amino acid sequences. This information was used to 
determine the sequence variation between the H. pylori strain, J99, and other H pylori 
15 strains isolated from human patients. 



Preparation of Chromosomal DNA. 

Cultures of H pylori strains (as listed in Table 9) were grown in BLBB (1% 
Tryptone, 1% Peptamin 0.1% Glucose, 0.2% Yeast Extract 0.5% Sodium Chloride, 5% 

20 Fetal Bovine Serum) to an ODgoo of 0.2. Cells were centrifuged in a Sorvall RC-3B at 
3500 X g at 4°C for 1 5 minutes and the pellet resuspended in 0.95 mis of 10 mM Tris- 
HCl, 0.1 mM EDTA (TE). Lysozyme was added to a final concentration of Img/ml 
along with, SDS to 1% and RNAse A + Tl to 0.5mg/ml and 5 units/ml respectively, and 
incubated at 3TC for one hour. Proteinase K was then added to a final concentration of 

25 0.4mg/ml and the sample was incubated at 55 C for more than one hour. NaCl was 

added to the sample to a concentration of 0.65 M, mixed carefully, and 0.15 ml of 10% 
CTAB in 0.7M NaCL (final is 1% CTAB/70mM NaCL) was added followed by 
incubation at 65*^C for 20 minutes. At this point, the samples were extracted with 
chloroform: isoamyl alcohol, extracted with phenol, and extracted again with 

30 chloroformrisoamyl alcohol. DNA was precipitated with either EtOH (1.5 x volumes) or 
isopropanol (0.6 x volumes) at -70°C for lOminutes, washed in 70% EtOH and 
resuspended in TE. 

PCR Amplification and cloning, 
35 Genomic DNA prepared from twelve strains of Helicobacter pylori was used as 

the source of template DNA for PCR amplification reactions (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). To 
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amplify a DNA sequence containing an K pylori ORF, genomic DNA (10 nanograms) 
was introduced into a reaction vial containing 2 mM MgCl2, 1 micromolar synthetic 
oligonucleotide primers (forward and reverse primers, see Table 7) complementary to 
and flanking a defined H, pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; 
5 dATP, dGTP, dCTP, dTTP and 0.5 units of heat stable DNA.polymerase (Amplitaq, 
Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 20 
microliters in duplicate reactions. 

Table 7 

10 Oligonucleotide primers used for PGR amplification of H. pylori DNA sequences. 



Outer membrane 


Forward primer 5' to 3' 


Reverse Primer 5' to 3' 


Protein 26054702 (for 
strains AH4, AH 15, 
AH6 1,5294, 5640, 
AH18, and AH244) 


5'- 

TTAACCATGGTGAAA 
AGCGATA-3' (SEQ ID 
NO: 169) 


5'- 

TAGAATTCGCCTCTA 
AAACTTTAG-3' (SEQ 
ID NO: 170) 


Protein 26054702 

(for strains AH5, 5155, 
7958, AH24,and J99) 


5'- 

TTAACCATGGTGAAA 
AGCGATA-3' (SEQ ID 
NO:171) 


5'- 

TAGAATTCGCATAAC 
GATCAATC-3' (SEQ ID 
NO: 172) 


Protein 7116626 


5'- 

ATATCCATGGTGAGT 
TTGATGA-3' (SEQ ID 
NO: 173) 


5'- 

ATGAATTCAATTTTT 
TATTTTGCCA-3' (SEQ 
ID NO: 174) 


Protein 29479681 


5'- 

AATTCCATGGCTATC 
CAAATCCG-3' (SEQ ID 
NO: 175) 


5'- 

ATGAATTCGCCAAAA 
TCGTAGTATT-3' (SEQ 
ID NO: 176) 


1 Protein 346 


5'- 

GATACCATGGAATTT 
ATGAAAAAG-3' (SEQ 
ID NO: 177) 


5'- 

TGAATTCGAAAAAGT 
GTAGTTATAC-3' (SEQ 
ID NO: 178) 



The following thermal cycling conditions were used to obtain amplified DNA 
products for each ORF using a Perkin Elmer Cetus/ GeneAmp PGR System 9600 
1 5 thermal cycler: 
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Protein 7 1 1 6626 and Protein 346; 
Denaturation at 94^C for 2 min, 

2 cycles at 94''C for 15 sec, 30°C for 15 sec and 72''C for 1.5 min 
5 23 cycles at 94°C for 15 sec, 55°C for 15 sec and 72''C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

Protein 26054702 for strains AH5, 5155, 7958, AH24,and J99; 
Denaturation at 94''C for 2 min, 
10 2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
25 cycles at 94°C for 15 sec, 55*'C for 15 sec and 72''C for 1.5 min 
Reaction was concluded at 72^C for 6 minutes. 

Protein 26054702 and Protein 294796813 for strains AH4, AH 15, AH61, 5294, 5640, 
15 AH18,andHp244 ; 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30''C for 20 sec and 72°C for 2 min 
25 cycles at 94°C for 15 sec, 55''C for 20 sec and 72°C for 2 min 
Reactions were concluded at 72"C for 8 minutes. 

20 

Upon completion of thermal cycling reactions, each pair of samples were 
combined and used directly for cloning into the pCR cloning vector as described below. 

Cloning ofH. pylori DNA sequences into the pCR TA cloning vector 
25 All amplified inserts were cloned into the pCR 2,1 vector by the method 

described in the Original TA cloning kit (Invitrogen, San Diego, CA). Products of the 
ligation reaction were then used to transform the TOPI OF' (INVaF' in the case ofK 
pylori sequence 350) strain of E, coli as described below. 

30 Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain TOPI OF' or E. coli strain INVaF' were 
transformed with recombinant pCR expression plasmids carrying the cloned K pylori 
sequences according to standard methods (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). Briefly, 2 microliters of 0.5 

35 micromolar BME was added to each vial of 50 microliters of competent cells. 

Subsequently, 2 microliters of ligation reaction was mixed with the competent cells and 
incubated on ice for 30 minutes. The ceils and ligation mixture were then subjected to a 
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"heat shock" at 42*^0 for 30 seconds, and were subsequently placed on ice for an 
additional 2 minutes, after which, samples were incubated in 0.45 milliliters SOC 
medium (0.5% yeast extract, 2.0 % tryptone, 10 mM NaCl, 2.5 mM KCl, 10 mM 
MgC12, 10 mM MgS04 and 20, mM glucose) at 37°C with shaking for 1 hour. Samples 
5 were then spread on LB agar plates containing 25 microgram/ml kanamycin sulfate or 
100 micrograms/ml ampicillan for growth overnight. Transformed colonies of TOPI OF' 
or INVaF' were then picked and analyzed to evaluate cloned inserts as described below. 
Identification of recombinant PCR plasmids carrying H. pylori sequences 

Individual TOPI OF' or INVaF' clones transformed with recombinant pCR- 

1 0 H.pylori ORFs were analyzed by PCR amplification of the cloned inserts using the same 
forward and reverse primers, specificTor eacS7/. pylori sequence, that were usedliTtfie^ 
original PGR amplification cloning reactions. Successful amplification verified the 
integration of the K pylori sequences in the cloning vector (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et ai., editors, 1994). 

15 Individual clones of recombinant pCR vectors carrying properly cloned H. pylori 

ORFs were picked for sequence analysis. Sequence analysis was performed on ABI 
Sequencers using standard protocols (Perkin Elmer) using vector-specific primers (as 
found in PCRII or pCR2.1, Invitrogen, San Diego, CA) and sequencing primers specific 
to the ORF as listed in Table 8 below. 



20 
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Table 8 

Oligonucleotide primers used for sequencing of H pylori DNA sequences. 



Outer membrane 
Proteins 


Forward primers 5' to 3' 


Reverse Primers 5' to 3' 


Protein 26054702 


5'- 

CCCTTCATTTTAGAAATC 
G-3' (SEQIDNO:179) 
5'- 

ATTTCAACCAATTCAAT 
GCG-3' (SEQ ID NO: 180) 
5'- 

GCCCCi 1 IIGATTTGAAG 
CT-3'(SEQID NO: 181) 
5'- 

TCGCTCCAAGATACCAA 
GAAGT-3' (SEQ ID 
NO: 182) 
-) - 

CTTGAATTAGGGGCAAA 
GATCG-3' (SEQ ID 
NO: 183) 
5'- 

ATGCGTTTTTACCCAAA 
GAAGT-3* (SEQ ID 
NO- 1 84^ 
5'- 

ATAACGCCACTTCCTTAT 
TGGT-3' (SEQ ID NO: 1 85) 


5*- 

CTTTGGGTAAAAACGCA 
TC-3' (SEQ ID NO: 186) 
5'- 

CGATCTTTGATCCTAATT 
CA-3'(SEQ ID NO: 187) 
5'- 

ATCAAGTTGCCTATGCT 
GA-3' (SEQ ID NO: 188) 








Protein 7116626 


5'- 

TTGAACACTTTTGATTAT 
GCGG-3' (SEQ ID NO: 189) 
5'- 

GGATTATGCGATTG i rn 
ACAAG-3' (SEQ ID 
NO: 190) 


5'- 

GTCTTTAGCAAAAATGG 
CGTC-3' (SEQ ID NO: 191) 
5'- 

AATGAGCGTAAGAGAGC 
CTTC-3* (SEQ ID NO: 1 92) 


Protein 
29479681 


5"- 

CTTATGGGGGTATTGTC 
A-3' (SEQ ID NO: 193) 
5'- 

AGCATGTGGGTATCCAG 
C-3' (SEQ ID NO: 194) 


5'- 

AGGTTGTTGCCTAAAGA 
CT-3* (SEQ ID NO: 195) 
5'- 

CTGCCTCCACCTTTGATC 
-3* (SEQ ID NO: 196) 
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Protein 346 


5'- 

ACCAATATCAATTGGCA 
CT-3' (SEQ ID NO: 197) 
5'- 

ACTTGGAAAAGCTCTGC 
A-3' (SEQ ID NO: 198) 


5'- 

CTTGCTTGTCATATCTAG 
C-3' (SEQ ID NO: 199) 
5'- 

GTTGAAGTGTTGGTGCT 
A-3* (SEQ ID NO:200) 




5'- 

CAAGCAAGTGGTTTGGT 
TTTAG-3' (SEQ ID NO:201) 
5'- 

TGGAAAGAGCAAATCAT 
TGAAG-3' (SEQ ID 
NO:202) 


5'- 

GCCCATAATCAAAAAGC 
CCAT-3' (SEQ ID NO:203) 
5'- 

CTAAAACCAAACCACTT 
OCT 

TGTC-3' (SEQ ID NO:204) 








Vector Primers 


5'- 

GTAAAACGACGGCCAG- 
3' (SEQ ID NO:205) 


5'- 

CAGGAAACAGCTATGAC 
-3'{SEOIDNO:206) 



Results 

To establish the PGR error rate in these experiments, five individual clones of 
Protein 26054702, prepared from five sepeirate PGR reaction mixtures from H, pylori 
5 strain J99, were sequenced over a total length of 897 nucleotides for a cumulative total 
of 4485 bases of DNA sequence. DNA sequence for the five clones was compared to a 
DNA sequence obtained previously by a different method, i.e., random shotgun cloning 
and sequencing. The PGR error rate for the experiments described herein was 
determined to be 2 base changes out of 4485 bases, which is equivalent to an estimated 

1 0 error rate of less than or equal to 0.04%. 

DNA sequence analysis was performed on four different open reading frames 
identified as genes and amplified by PGR methods from a dozen different strains of the 
bacterium Helicobacter pylori. The deduced amino acid sequences of three of the four 
open reading frames that were selected for this study showed statistically significant 

1 5 BLAST homology to defined proteins present in other bacterial species. Those ORFs 
included: Protein 26054702, homologous to the val A & B genes encoding an ABG 
transporter in F. novicida; Protein 71 16626, homologous to lipoprotein e (P4) present in 
the outer membrane of H. influenzae; Protein 29479681, homologous to fecA, an outer 
membrane receptor in iron (III) dicitrate transport in E. coli. Protein 346 was identified 

20 as an unknown open reading frame, because it showed low homology with sequences in 
the public databases. 

To assess the extent of conservation or variance in the ORFs across various 
strains of H, pylori, changes in DNA sequence and the deduced protein sequence were 
compared to the DNA and deduced protein sequences found in the J99 strain of H, 
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pylori (see Table 9 below). Results are presented as percent identity to the J99 strain of 
K pylori sequenced by random shotgun cloning. To control for any variations in the 
J99 sequence each of the four open reading frames were cloned and sequenced again 
from the J99 bacterial strain and that sequence information was compared to the 
5 sequence information that had been collected from inserts cloned by random shotgun 
sequencing of the J99 strain. The data demonstrate that there is variation in the DNA 
sequence ranging from as little as 0.12 % difference (Protein 346, J99 strain) to 
approximately 7% change (Protein 26054702, strain AH5). The deduced protein 
sequences show either no variation ( Protein 346, strains AH 1 8 and AH24) or up to as 
10 much as 7.66% amino acid changes (Protein 26054702, Strain AH5). 

Table 9 

Multiple Strain DNA Sequence analysis of H. pylori Vaccine Candidates 

15 

J99 Protein #: 26054702 2054702 7116626 7116626 29479681 29479681 346 346 
Length of Region 

Sequenced: 248 a.a. 746 nt. 232 a,a. 96 nt. 1 82 a.a. 548 nt 273 a.a. 819 nt. 

20 

Strain Tested 





AA 

identity 


Nuc. 
identity 


AA 

identity 


Nuc. 
identity 


AA 

identity 


Nuc. 
identity 


AA 

identity 


Nuc. 
identity 


J99 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


99.63% 


99.88% 


AH244 


95.16% 


95.04% 


n.d. 


n.d. 


99.09% 


96.71% 


98.90% 


96.45% 


AH4 


95.97% 


95.98% 


97.84% 


95.83% 


n.d. 


n.d. 


97.80% 


95.73% 


AH5 


92.34% 


93.03% 


98.28% 


96.12% 


98.91% 


96.90% 


98.53% 


95.73% 


AH15 


95.16% 


94.91% 


97.41% 


95.98% 


99.82% 


97.99% 


99.63% 


96,09% 


AH61 


n.d. 


n.d. 


97.84% 


95.98% 


99.27% 


97.44% 


n.d. 


n.d. 


5155 


n.d. 


n.d. 


n.d. 


n.d. 


99.45% 


97.08% 


98.53% 


95.60% 


5294 


94.35% 


94.37% 


98.28% 


95.40% 


99.64% 


97.26% 


97.07% 


95.48% 


7958 


94.35% 


94.10% 


97.84% 


95.40% 


n.d. 


n.d. 


99.63% 


96.46% 


5640 


95.16% 


94.37% 


97.41% 


95.69% 


99.09% 


97.63% 


98.53% 


95.48% 


AH18 


n.d. 


n.d. 


98.71% 


95.69% 


99.64% 


97.44% 


100.00% 


95.97% 


AH24 


94.75% 


95.04% 


97.84% 


95.40% 


99.27% 


96.71% 


100.00% 


96.46% 



n.d.= not done. 
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VI. Experimental Knock-Qut Protocol for the Determination of Essential K pylori 
Genes as Potential Therapeutic Targets 

Therapeutic targets are chosen from genes whose protein products appear to play 
5 key roles in essential cell pathways such as cell envelope synthesis, DNA synthesis, 
transcription, translation, regulation and coloni2:ation/virulence. 

The protocol for the deletion of portions of K pylori genes/ORFs and the 
insertional mutagenesis of a kanamycin-resistance cassette in order to identify genes 
which Eire essential to the cell is modified from previously published methods (Labigne- 
10 Roussel et al., 1988, J. Bacteriology 170, pp. 1704-1708; Cover et aL,1994, J. Biological 
Chemistry 269, pp: 10566- r0573r 1995, Proc. Natl. AcW. Sci:"92, pp" 

8768-8772). The result is a gene "knock-out." 

Identification and Cloning of H. pylori Gene Sequences 

15 The sequences of the genes or ORFs (open reading frames) selected as knock-out 

targets are identified from the K pylori genomic sequence and used to design primers to 
specifically amplify the genes/ORFs. All synthetic oligonucleotide primers are designed 
with the aid of the OLIGO program (National Biosciences, Inc., Plymouth, MN 55447, 
USA), and can be purchased from Gibco/BRL Life Technologies (Gaithersburg, MD, 

20 USA). If the ORF is smaller than 800 to 1000 base pairs, flanking primers are chosen 
outside of the open reading frame. 

Genomic DNA prepared from the Helicobacter pylori HpJ99 strain (ATCC 
55679; deposited by Genome Therapeutics Corporation, 100 Beaver Street, Waltham, 
MA 02154) is used as the source of template DNA for amplification of the ORFs by 

25 PCR (polymerase chain reaction) (Current Protocols in Molecular Biology, John Wiley 
and Sons, Inc., F. Ausubel et al., editors, 1994). For the preparation of genomic DNA 
from K pyloriy see Example I. PCR amplification is carried out by introducing 10 
nanograms of genomic HpJ99 DNA into a reaction vial containing 10 mM Tris pH 8.3, 
50 mM KCl, 2 mM MgCl2, 2 microMolar synthetic oligonucleotide primers 

30 (forward=Fl and reverse=Rl), 0.2 mM of each deoxynucleotide triphosphate 

(dATP,dGTP, dCTP, dTTP), and 1,25 units of heat stable DNA polymerase (Amplitaq, 
Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 40 
microliters. The PCR is carried out with Perkin Elmer Cetus/GeneAmp PCR System 
9600 thermal cyclers. 

35 Upon completion of thermal cycling reactions, each sample of amplified DNA is 

visualized on a 2% TAE agarose gel stained with Ethidium Bromide (Current Protocols 
in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) to 
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determine that a single product of the expected size had resulted from the reaction. 
Amplified DNA is then washed and purified using the Qiaquick Spin PGR purification 
kit (Qiagen, Gaithersburg, MD, USA). 

PGR products are cloned into the pT7Blue T-Vector (catalog#69820-l , Novagen, 
5 Inc., Madison, WL USA) using the TA cloning strategy (Gurrent Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). The ligation of die 
PGR product into the vector is accomplished by mixing a 6 fold molar excess of the 
PGR product, 10 ng of pT7Blue-T vector (Novagen), 1 microliter of T4 DNA Ligase 
Buffer (New England Biolabs, Beverly, MA, USA), and 200 units of T4 DNA Ligase 

10 (New England Biolabs) into a final reaction volume of 10 microliters. Ligation is 
allowed to proceed for 16 hours at 16°G. 

Ligation products are electroporated (Gurrent Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) into electroporation- 
competent XL-1 Blue or DH5-a Kcoli cells (Glontech Lab., Inc. Palo Alto, GA, USA). 

15 Briefly, 1 microliter of ligation reaction is mixed with 40 microliters of 

electrocompetent cells and subjected to a high voltage pulse (25 microFarads, 2.5 kV, 
200 ohms) after which the samples are incubated in 0.45 ml SOG medium (0.5% yeast 
extract, 2% tryptone, 10 mM NaGl, 2.5 mM KGl, 10 mM MgGl2, 10 mM MgS04 and 20 
mM glucose) at 1>TC with shaking for 1 hour. Samples are then spread onto LB (10 g/1 

20 bacto tryptone, 5 g/1 bacto yeast extract, 10 g/1 sodium chloride) plates containing 100 
microgram/ml of Ampicillin, 0.3% X-gal, and 100 microgram/ml IPTG. These plates 
are incubated overnight at 37^G. Ampicillin-resistant colonies with white color are 
selected, grown in 5 ml of liquid LB containing 100 microgram/ml of Ampicillin, and 
plasmid DNA is isolated using the Qiagen miniprep protocol (Qiagen, Gaithersburg, 

25 MD, USA). 

To verify that the correct H.pylori DNA inserts had been cloned, these pTTBlue 
plasmid DNAs are used as templates for PGR amplification of the cloned inserts, using 
the same forward and reverse primers used for the initial amplification of the J99 
Kpylori sequence. Recognition of the primers and a PGR product of the correct size as 
30 visualized on a 2% TAB, ethidium bromide stained agarose gel are confirmation that the 
correct inserts had been cloned. Two to six such verified clones are obtained for each 
knock-out target, and frozen at -70*^G for storage. To minimize errors due to PGR, 
plasmid DNA from these verified clones are pooled, and used in subsequent cloning 
steps. 

35 The sequences of the genes/ORFs are again used to design a second pair of 

primers which flank the region of H. pylori DNA to be either interrupted or deleted (up 
to 250 basepairs) within the ORFs but are oriented away from each other. The pool of 



^tNSDOCID: <\NQ 



981S3P3A1 I > 



wo 98/18323 



FCT/US97/19575 



-83- 

circular plasmid DNAs of the previously isolated clones are used as templates for this 
round of PGR. Since the orientation of amplification of this pair of deletion primers is 
away from each other, the portion of the ORF between the primers is not included in the 
resultant PGR product. The PGR product is a linear piece of DNA with K pylori DNA 
5 at each end and the pT7Blue vector backbone between them which, in essence, resultes 
in the deletion of a portion of the ORFs. The PGR product is visualized on a 1% TAE, 
ethidium bromide stained agarose gel to confirm that only a single product of the correct 
size has been amplified. 

A Kanamycin-resistance cassette (Labigne-Roussel et al., 1988 J. Bacteriology 

10 1 70, 1 704- 1 708) is ligated to this PGR product by the TA cloning method used 
^reviou^lyn[Currenf Protocols irTMoleculaP Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., editors, 1994). The Kanamycin cassette containing a Campylobacter 
kanamycin resistance gene is obtained by carrying out an EcoRI digestion of the 
recombinant plasmid pCT^%\kan (Gover et al.,1994, J. Biological Ghemistry 269, pp. 

1 5 1 0566- 1 0573). The proper fragment ( 1 .4 kb) is isolated on a 1 % TAE gel, and isolated 
using the QIAquick gel extraction kit (Qiagen, Gaithersburg, MD, USA). The fragment 
is end repaired using the Klenow fill-in protocol, which involved mixing 4ug of the 
DNA fragment, 1 microliter of dATP,dGTP, dGTP, dTTP at 0.5 mM, 2 microliter of 
Klenow Buffer (New England Biolabs) and 5 units of Klenow DNA Polymerase I Large 

20 (Klenow) Fragment (New England Biolabs) into a 20 microliter reaction, incubating at 
30°G for 15 min, and inactivating the enzyme by heating to 75°G for 10 minutes. This 
blunt-ended Kanamycin cassette is then purified through a Qiaquick column (Qiagen, 
Gaithersburg, MD, USA) to eliminate nucleotides. The "T" overhang is then generated 
by mixing 5 micrograms of the blunt-ended kanamycin cassette, 10 mM Tris pH 8.3, 50 

25 mM KGl, 2 mM MgGl2, 5 units of DNA Polymerase (Amplitaq, Roche Molecular 

Systems, Inc., Branchburg, NJ, USA), 20 microliters of 5 mM dTTP, in a 100 microliter 
reaction and incubating the reaction for 2 hours at 37°G. The "Kan-T" cassette is 
purified using a QIAquick column (Qiagen, Gaithersburg, MD, USA). The PGR 
product of the deletion primers (F2 and R2) is ligated to the Kan-T cassette by mixing 

30 10 to 25 ng of deletion primer PGR product, 50 - 75 ng Kan-T cassette DNA, 1 

microliter lOx T4 DNA Ligase reaction mixture, 0.5 microliter T4 DNA Ligase (New 
England Biolabs, Beverly, MA, USA) in a 10 microliter reaction and incubating for 16 
hours at 16**G. 

The ligation products are transformed into XL-1 Blue or DH5-a E.coli cells by 
35 electroporadon as described previously. After recovery in SOG, cells are plated onto LB 
plates containing 100 microgram/ml Ampicillin and grown ovemight at 37^G. These 
plates are then replica plated onto plates containing 25 microgram/ml Kanamycin and 
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allowed to grow overnight. Resultant colonies have both the Ampicillin resistance gene 
present in the pT7Blue vector, and the newly introduced Kanamycin resistance gene. 
Colonies are picked into LB containing 25 microgram/ml Kanamycin and plasmid DNA 
is isolated from the cultured cells using the Qiagen miniprep protocol (Qiagen, 

5 Gaithersburg, MD. USA). 

Several tests by PGR amplification are conducted on these plasmids to verify that 
the Kanamycin is inserted in the H. pylori gene/ORF, and to determine the orientation of 
the insertion of the Kanamycin-resistance gene relative to the H. pylori gene/ORF. To 
verify that the Kanamycin cassette is inserted into the H. pylori sequence, the plasmid 

1 0 DN As are used as templates for PGR amplification with the set of primers originally 
used to clone the H. pylori gene/ORFs. The correct PGR product is the size of the 
deleted gene/ORF but increased in size by the addition of a 1.4 kilobase Kanamycin 
cassette. To avoid potential polar effects of the kanamycin resistance cassette on H. 
pylori gene expression, the orientation of the Kanamycin resistance gene with respect to 

1 5 the knock-out gene/ORF is determined and both orientations are eventually used in H. 
pylori transformations (see below). To determine the orientation of insertion of the 
kanamycin resistance gene, primers are designed from the ends of the kanamycin 
resistance gene ("Kan-1" 5'-ATGTTAGGTATGAGGTGAAAT-3' (SEQ ID NO:207)), 
and "Kan-2" 5'-AGAGAGGAAGATGTTTGTGAA-3' (SEQ ID NO:208)). By using 

20 each of the cloning primers in conjunction with each of the Kan primers (4 combinations 
of primers), the orientation of the Kanamycin cassette relative to the H.pylori sequence 
is determined. Positive clones are classified as either in the "A" orientation (the same 
direction of transcription is present for both the H. pylori gene and the Kanamycin 
resistance gene), or in the "B" orientation (the direction of transcription for the H.pylori 

25 gene is opposite to that of the Kanamycin resistance gene). Clones which share the 
same orientation (A or B) are pooled for subsequent experiments and independently 
transformed into H. pylori. 

Transformation of Plasmid DNA into H. pylori cells 

30 Two strains of//, pylori are used for transformation: ATCC 55679, the clinical 

isolate which provided the DNA from which the H. pylori sequence database is 
obtained, and AH244, an isolate which had been passaged in, and has the ability to 
colonize the mouse stomach. Cells for transformation are grown at 37**C, 10% GO2, 
100% humidity, either on Sheep-Blood agar plates or in Brucella Broth liquid. Cells are 

35 grown to exponential phase, and examined microscopically to determine that the cells 

are "healthy" (actively moving cells) and not contaminated. If grown on plates, cells are 
harvested by scraping cells from the plate with a sterile loop, suspended in 1 ml of 
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Brucella Broth, spun down (1 minute, top speed in eppendorf microfiige) and 
resuspended in 200 microliters Brucella Broth. If grown in Brucella Broth liquid, cells 
are centrifuged (15 minutes at 3000 rpm in a Beckman TJ6 centrifuge) and the cell pellet 
resuspended in 200 microliters of Brucella broth. An aliquot of cells is taken to 
5 determine the optical density at 600 nm, in order to calculate the concentration of cells. 
An aliquot (1 to 5 ODeoo units/25 microliter) of the resuspended cells is placed onto a 
prewarmed Sheep-Blood agar plate, and the plate is further incubated at 37^C, 6% CO2, 
100% humidity for 4 hours. After this incubation, 10 microliters of plasmid DNA (100 
micrograms per microliter) is spotted onto these cells. A positive control (plasmid DNA 
10 with the ribonuclease H gene disrupted by kanamycin resistance gene) and a negative 
control (no plasmid'DN A) are done in"paraiiel. "Th^ 

for an additional 4 hours of incubation. Cells are then spread onto that plate using a 
swab wetted in Brucella broth, and grown for 20 hours at 3 7*^0, 6% CO2. Cells are then 
transferred to a Sheep-Blood agar plate containing 25 micrograms/ml Kanamycin, and 

15 allowed to grow for 3 to 5 days at 37"C, 6% CO2, 100% humidity. If colonies appear, 
they are picked and regrown as patches on a fresh Sheep-Blood agar plate containing 25 
micrograms/ml Kanamycin. 

Three sets of PCR tests are done to verify that the colonies of transformants have 
arisen from homologous recombination at the proper chromosomal location. The 

20 template for PCR (DNA from the colony) is obtained by a rapid boiling DNA 

preparation method as follows. An aliquot of the colony (stab of the colony with a 
toothpick) is introduced into 100 microliters of 1% Triton X-100, 20 mM Tris, pH 8.5, 
and boiled for 6 minutes. An equal volume of phenol : chloroform (1 :1) is added and 
vortexed. The mixture is microfuged for 5 minutes and the supernatant is used as DNA 

25 template for PCR with combinations of the following primers to verify homologous 
recombination at the proper chromosomal location. 

TEST 1. PCR with cloning primers originally used to amplify the gene/ORF. A 
positive result of homologous recombination at the correct chromosomal location should 
show a single PCR product whose size is expected to be the size of the deleted 

30 gene/ORF but increased in size by the addition of a 1 .4 kilobase Kanamycin cassette. A 
PCR product of just the size of the gene/ORF is proof that the gene had not been 
knocked out and that the transformant is not the result of homologous recombination at 
the correct chromosome location. 

TEST 2. PCR with F3 (primer designed from sequences upstream of the 

35 gene/ORF and not present on the plasmid), and either primer Kan-1 or Kan-2 (primers 
designed from the ends of the kanamycin resistance gene), depending on whether the 
plasmid DNA used was of "A" or "B" orientation. Homologous recombination at the 
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correct chromosomal location will result in a single PGR product of the expected size 
(i.e., from the location of F3 to the insertion site of kanamycin resistance gene). No 
PGR product or PGR product(s) of incorrect size(s) will prove that the plasmid had not 
integrated at the correct site and that the gene had not been knocked out. 

5 TEST 3. PGR with R3 (primer designed from sequences downstream of the 

gene/ORF and not present on the plasmid) and either primer Kan- 1 or Kan-2, depending 
on whether the plasmid DNA used was of "A" or "B" orientation. Homologous 
recombination at the correct chromosomal location will result in a single PGR product of 
the expected size (i.e., from the insertion site of kanamycin resistance gene to the 

1 0 downstream location of R3). Again, no PGR product or PGR product(s) of incorrect 

size(s) will prove that the plasmid had not integrated at the correct site and that the gene 
had not been knocked out. 

Transformants showing positive results for all three tests above indicate that the 

gene is not essential for survival in vitro. 

1 5 A negative result in any of the three above tests for each transformant indicates 

that the gene had not been disrupted, and that the gene is essential for survival in vitro. 

In the event that no colonies result from two independent transformations while 
the positive control with the disrupted ribonuclease H plasmid DNA produces 
transformants, the plasmid DNA is further analyzed by PGR on DNA from transformant 

20 populations prior to plating for colony formation. This will verify that the plasmid can 
enter the cells and undergo homologous recombination at the correct site. Briefly, 
plasmid DNA is incubated according to the transformation protocol described above. 
DNA is extracted from the H. pylori cells immediately after incubation with the plasmid 
DNAs and the DNA is used as template for the above TEST 2 and TEST 3. Positive 

25 results in TEST 2 and TEST 3 would verify that the plasmid DNA could enter the cells 
and undergo homologous recombination at the correct chromosomal location. If TEST 
2 and TEST 3 are positive, then failure to obtain viable transformants indicates that the 
gene is essential, and cells suffering a disruption in that gene are incapable of colony 
formation 



30 



VII. Hieh-throughput drug screen assay 
Cloning, expression and protein purification 

Gloning, transformation, expression and purification of the H. pylori target gene 
and its protein product,e.g., an H. pylori enzyme, to be used in a high-throughput drug 
35 screen assay, is carried out essentially as described in Examples II and III above. 
Development and application of a screening assay for a particular H. pylori gene 
product, peptidyl-propyl cis-trans isomerase, is described below as a specific example. 
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Enzymatic Assay 

The assay is essentially as described by Fisher (Fischer, G., et.al, (1984) Biomed 
Biochim, ^c/a 43 : 1 1 0 1 - 1 1 1 1 ). The assay measures the cis-trans isomerization of the 
Ala-Pro bond in the test peptide N-succinyl-Ala-Ala-Pro-Phe-p-nitroanilide (Sigma # S- 
5 7388, lot # 84H5805). The assay is coupled with a-chymotrypsin, where the ability of 
the protease to cleave the test peptide occurs only when the Ala-Pro bond is in trans. 
The conversion of the test peptide to the trans isomer in the assay is followed at 390 nm 
on a Beckman Model DU-650 spectophotometer. The data are collected every second 
with an average scanning of time of 0.5 second. Assays are carried out in 35 mM 
10 Hepes, pH 8.0, in a final volume of 400 ul, with 10 ^iM a-chymotrypsin (type 1-5 from 
'bovine-pancreasrSigma-#-e=7762rlot^3H7020)-and-rG-nM-pprase— ^ 
reaction, 10 \i\ of the substrate ( 2 mM N-Succinyl-Ala-Ala-Pro-Phe-p-nitroanilide in 
DMSO) is added to 390 \x\ of reaction mixture at room temperature. 

1 5 Enzymatic assay in crude bacterial extract, 

A 50 ml culture of Helicobacter pylori (strain J99) in Brucella broth is harvested 
at mid-log phase (OD 500 nm ^ ^ ) resuspended in lysis buffer with the following 
protease inhibitors: 1 mM PMSF, and 10 |ag/ml of each of aprotinin, leupeptin, 
pepstatine, TLCK, TPCK, and soybean trypsin inhibitor. The suspension is subjected to 

20 3 cycles of freeze-thaw (15 minutes at -70 ° C, then 30 minutes at room temperature), 
followed by sonication (three 20 second bursts). The lysate is centrifuged (12,000 g x 
30 minutes) and the supernatant is assayed for enzymatic activity as described above. 

Many //. pylori enzymes can be expressed at high levels and in an active form in 
E, coll. Such high yields of purified proteins provide for the design of various high 

25 throughput drug screening assays. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments and methods 
30 described herein. Such equivalents are intended to be encompassed by the scope of the 
following claims. 
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SEQUENCE LISTING 



1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: Astra Aktiebolag 

(B) STREET: S-151 85 

(C) CITY: Sodertalje 
10 (D) STATE: 

{ E ) COUNTRY : Sweden 
(F) POSTAL CODE (ZIP) 

(ii) TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES 

J 5 RELATING TO HELICOBACTER PYLORI AND 

VACCINE COMPOSITIONS THEREOF 



20 



30 



50 



55 



(iii) NUMBER OF SEQUENCES: 208 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 
25 (D) SOFTWARE: 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER 

(B) FILING DATE: 



(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/739,150 

(B) FILING DATE: 28-OCT-1996 



35 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/759,739 

(B) FILING DATE: 06-DEC-1996 

(viii) PRIOR APPLICATION DATA: 
40 (A) APPLICATION NUMBER: US 08/891,928 

(B) FILING DATE: 14-JULY-1997 

(ix) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

45 (B) STREET: 28 State Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109-1875 



(x) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Mandragouras , Amy E. 

(B) REGISTRATION NUMBER: 3 6,2 07 

(C) REFERENCE /DOCKET NUMBER: GTN- OOlCPlOPC 



SNSDOCID: <WO 981S323A1_I_> 
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(xi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)742-4214 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc__f eature 

(B) LOCATION 1. . .561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGATTAAAA GAATTGCTTG TATTTTAAGC TTGAGCGCGA GTTTAGCGTT AGCTGGCGAA 60 

GTGAATGGGT TTTTCATGGG TGCGGGTTAT CAACAAGGTC GTTATGGCCC TTATAACAGC 120 

AATTACTCTG ATTGGCGTCA TGGCAATGAC CTTTATGGTT TGAATTTCAA ATTAGGTTTT 180 

GTAGGCTTTG CCAATAAATG GTTTGGGGCT AGGGTGTATG GCTTTTTAGA TTGGTTTAAC 24 0 

ACTTCAGGGA CTGAACACAC CAAAACCAAT TTGCTCACCT ATGGCGGCGG TGGCGATTTG 3 00 

ATTGTCAATC TCATTCCTTT GGATAAATTC GCTCTAGGTC TCATTGGTGG CGTTCAATTA 360 

GCCGGAAACA CTTGGATGTT CCCTTATGAT GTCAATCAAA CCAGATTCCA GTTCTTATGG 420 

AATTTAGGCG GAAGAATGCG TGTTGGGGAT CGCAGTGCGT TTGAAGCGGG CGTGAAATTC 4 80 

CCTATGGTTA ATCAGGGTAG CAAAGATGTA GGGCTTATCC GCTACTATTC TTGGTATGTG 54 0 

GATTATGTCT TCACTTTCTA G 561 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 51 base pairs 

(B) TYPE ; nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...351 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TTGATGCGCA TTATCATAAG GTTACTTTCA TTTAAAATGA ACGCTTTTTT AAAACTCGCG 60 

CTCGCTTCTT TGATGGGGGG GCTTTGGTAT GCTTTCAATG GCGAAGGCTC TGAGATTGTC 120 

GCTATAGGGA TTTTTGTGTT GATCTTGTTT GTTTTTTTTA TCCGCCCTGT GAGTTTCCAA 180 

10 GACCCAGAAA AACGAGAAGA ATACATAGAA CGGCTTAAAA AAAACCATGA GAGGAAAATG 240 

ATCTTACAAG ACAAGC7VAAA AGAAGAGCAA ATGCGCCTCT ATCAAGCCAA AAAAGAGCGA 300 

GAGAGCAGGC AAAAACAAGA CCTTAAAGAA CAAATGAAAA AATACTCATA A 351 



15 



25 



30 



(2) INFORMATION FOR SEQ ID NO : 3 ; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1038 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGGTTAAAC ACTATCTTTT CATGGCGGTT TCGCAGGTCT TTTTCTCCTT CTTTTTAGTG 60 

CTGTTTTTTA TCTCTTCCAT TGTGTTATTA ATCAGTATTG CAAGCGTAAC GCTCGTGATT 120 

AAAGTGAGCT TTTTGGATCT GGTGCAACTC TTTTTGTATT CCTTGCCAGG AACCATTTTT 180 

40 TTTATTTTGC CGATCACTTT TTTTGCGGCT TGCGCTTTGG GGCTTTCAAG GCTTAGCTAT 240 

GACCATGAAT TGTTAGTGTT TTTCTCTTTA GGGGTTTCGC CTAAAAAAAT GACTAAAGCG 3 00 

TTTGTGCCTT TAAGTTTGTT AGTGAGCGCG ATTTTATTAG CGTTTTCGCT CATCTTAATC 360 

CCCACTTCTA AGAGCGCTTA TTACGGGTTT TTGCGTCAAA AAAAAGACAA GATTGACATT 42 0 

AACATCAGAG CGGGTGAATT CGGGCAAAAA TTAGGCGATT GGCTCGTGTA TGTGGATAAG 48 0 

45 ACTGAAAACA ATTCCTATGA TAATTTGGTG CTTTTTTCTA ATAAAAGTCT CTCTCAAGAA 540 

AGCTTTATTT TGGCTCAAAA AGGCAATATC AACAATCAAA ACGGCGTGTT TGAATTGAAT 600 

TTGTATAACG GGCATGCGTA TTTCACTCAA GGCGATAAAA TGCGTAAGGT TGATTTTGAA 660 

GAATTGCATT TGCGCAACAA GCTCJVAGTCT TTCAATTCTA ATGATGCGGC TTATTTGCAA 720 

GGCACGGATT ATTTGGGTTA TTGGAAAAAA GCCTTTGGTA AAAACGCTAA TAAAAATCAA 780 

50 AAACGCCGTT TTTCTCAAGC GATCTTAGTT TCCTTGTTCC CTTTAGCGAG CGTGTTTTTA 840 

ATCCCCTTAT TTGGCATCGC CAACCCGCGA TTCAAAACGA ATTGGAGTTA TTTCTATGTC 900 

CTTGGAGCGG TTGGGGTTTA TTTTTTAATG GTGCATGTGA TTTCTACGGA TTTGTTTTTG 960 

ATGACCTTTT TCTTCCCCTT TATTTGGGCG TTTATTTCTT ATTTATTGTT TAGAAAATTC 102 0 

ATTTTAAAGC GTTATTAA 1038 

55 
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(2) INFORMATION FOR SEQ ID NO ; 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: JHelicLobacter ^pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

ATGAAGAAAA AAGCAAAAGT CTTTTGGTGT TGTTTTAAAA 

GCGGTCTTTT TTTTGTTGAG CGTATCAGAC GCTAAAGAAA 

AAACAAAACC ATAAGATTTT TGAAATCCTT GCGGATAAAG 

ATAACCGCCT CAGGGAATGC GATCCTATTG AATTATGACG 

GTGCGTTATG ACACCAAGAC TAAAGAAGCG TTATTAGAAG 

GGCGAGGGCT TGCTCGTTAA AACCGATTAT GTGAAATTGA 

ATCATTTTCC CCTTTTATGT CCT^GACAGC GTGAGCGGGA 

GCTAGCGGGA AGGATCAAAA ATATAAGATT AAAAACATGA 

GACAACCCCA TTTGGCATGT CAATGCGACT TCAGGCTCAT 

TTGTCAATGT GGAATCCTAA GATTTATGTC GGCGATATTC 

ATTTTCATGT CCACGAGCAA TAAAAGAACT ACCGGGTTTT 

TCCAACTTAG ACGGCTTTAT TTATTTGCAA CCCTTTTATT 

GATATGACCT TTACCCCACA AATCCGTTAC AAAAGGGGTT 

CGCTACATCA ACTCTAAGAC GCAGGTTTTT ATTCAATGCG 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETI CAL : NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



TGATTCGTTG 


GTTGTATTTG 


60 


TCGCTATGCA 


ACGATTTGAC 


120 


TGAGCGCCAA 


AGACAATGTG 


180 


TGTATATTCT 


AGCGGATAAG 


240 


GCAATATTAA 


GGTTTATAGG 


300 


GTTTGAACGA 


AAAATATGAG 


360 


TTTGGGTGAG 


CGCGGATATT 


420 


GCGCTTCAGG 


GTGCAGCATT 


480 


TTAACATGCA 


AAAATCGCAT 


540 


CTGTATTGTA 


TTTGCCCTAT 


600 


TATACCCTGA 


GTTTGGCACT 


660 


TAGCCCCCAA 


AAACTCATGG 


720 


TTGGCTTGAA 


TTTTGAAGCG 


780 


CGCTATTTTA 


G 


831 
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(ix) FEATURE: 

(A) NAME/KEY : misc__f eature 

(B) LOCATION 1,..675 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATGATTAGAT TAAAAGGTTT GAATAAAACT TTAAAAACAA GCTTATTAGC TGGGGTTTTA 60 

CTAGGTGCTA CTGCTCCCTT AATGGCAAAG CCTTTATTAA GCGATGAAGA CTTATTGAAA 120 

CGAGTAAAAC TACACAATAT CAAAGAAGAT ACGCTGACTA GCTGTAATGC TAAGGTGGAC 180 

10 GGCTCTCAAT ACTTGAATAG TGGTTGGAAT TTATCTAAAG AATTTCCGCA AGAATATAGA 24 0 

GAAAAGATTT TTGAATGCGT AGAAGAAGAA AAACATAAAC AAGCCCTTAA TTTAATCAAT 3 00 

AAAGAAGACA CTAAAGATAA AGAAGAACTT GCAAAAAAAA TCAAAGAAAT TAAAGAAAAA 3 60 

GCTAAAGTTT TAAGGCAAAA ATTTATGGCT TTTGAAATGA AAGAACACTC TAAAGAATTC 420 

CCAAATAAAA AGCAACTTCA AACCATGCTT GAGAACGCTT TTGATAATGG AGCTGAAAGT 4 80 

15 TTTATTGATG ATTGGCACGA ACGCTTTGGG GGTATAAGTA GAGAGAATAC TTATAAAGCA 54 0 

CTTGGCATTA AAGAATATAG TGATGAAGGA AAGATATTGC CTTTGGCGAA AGAAGTTATA 600 

TTAGACAATA TAAAAAAGAT TTTGAAGAAA GCACTTATGA TACTAGACAA CCCTTATCTG 66 0 

CTATGGCTAG TATGA ^"^^ 

20 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...1290 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 ; 



ATGCCATACG CCTTAAGAAA AAGATTTTTC AAACGCCTTT TATTGTTTTT TTTAATTGTT 60 

TGTATGATAA ATTTGCATGC CAAAAGCTAT CTGTTTTCTC CTTTGCCCCC AGCGCACCAG 120 

45 CAAATCATTA AGACAGAGCC TTGCTCTTTG GAGTGCTTGA AAGACTTGAT GCTGCAAAAT 180 

CAAATCTTTT CTTTTGTATC CCAATACGAT GATAACAACC AAGATGAGAG CCTTAAAACT 240 

TATTACAAGG ACATCTTAAA CAAACTCAAC CCCGTATTCA TCGCTTCTCA AACTCCAGCT 300 

AAAGAAAGCT ATGAGCCTAA GATTGAATTA GCGATTTTAC TGCCTAAAAA GGTGGTGGGC 360 

CGTTATGCGA TTTTAGTGAT GAACACCCTT TTAGCGTATT TGAACACCAG AAACAACGAT 42 0 

50 TTCAATATCC AAGTCTTTGA CAGCGATGAA GAAAGCCCTG AAAAATTAGA AGAAACCTAT 480 

AAAGAAATTG AAAAAGAAAA ATTCCCTTTT ATCATCGCTT TATTGACTAA AGAGGGCGTG 540 

GAAAATTTGC TCCAAAATAC GACTATCAAT ACCCCTACTT ATGTGCCTAC GGTGAATAAA 600 

ACGCAATTAG AAAATCATAC CGAGCTTTCT TTAAGCGAGC GCTTGTATTT TGGGGGGATT 660 

GATTATAAAG AGCAATTAGG CATGCTCGCA ACTTTCATTA GCCCTAATTC GCCCGTGATT 720 

55 GAATACGATG ATGATGGCCT GATAGGTGAA CGCTTGAGGC AAATCACGGA GTCTTTAAAC 780 
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GTTGAAGTCA AACACCAAGA AT^CATTTCT TACAAACAAG CGACCAGTTT TTCTAAAAAT 840 

TTTAGAAAAC ATGATGCGTT TTTTAAAAAT TCTACCTTAA TTTTGAACAC CCCTACCACT ,900 

AAAAGCGGTC TGATCCTTTC TCAAATAGGG CTTTTAGAGT ATAAGCCTCT TAAAATCCTT 960 

TCCACACAAA TCAATTTCAA CCCCTCTTTA CTCTTGCTCA CCCAGCCTAA AGACAGGAAA 1020 

AATTTATTCA TTGTCAATGC CTTGCAAAAC AGCGATGAAA CGCTGATAGA ATACGCTTCC 1080 

TTATTAGAGA GCGATTTAAG GCATGATTGG GTGJ^TTATT CCAGCGCGAT AGGGCTAGAG 1140 

ATGTTTTTAA ACACGCTAGA TCCGCATTTT AAAAAGTCTT TTCAAGAGAG TTTGGAAGAC 12 00 

AATCAAGTCC GTTACCACAA TCAAATTTAT CAGGCTTTAG GGTATTCTTT TGAGCCGATA 1260 

AAAAACGAAA GCGTUVACAAA ATIAAGAATAA 12 90 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 6 8 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1368 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GTGTTAAAAT TTCAAAAATT ACCCTTATTG TTTGTTTCCA TTCTTTATAA TCAAAGCCCT 6 0 

TTATTGGCTT TTGATTATAA GTTTAGTGGG GTAGCGGAAT CTGTTTCTAA AGTGGGGTTT 12 0 

AACCATTCCA AACTCAATTC CAAAGAAGGG ATTTTCCCTA CAGCCACCTT TGTAACCGCC 18 0 

ACGATCAAGC TTCAAGTGGA TTCCAATCTG CTCCCTAAAA ACATTGT^AAA ACACAGCTTA 240 

AAAATAGGCG TTGGGGGGAT TTTAGGAGCG CTCGCTTACG ATTCCACCAA AACGCTCATA 300 

GACCAAGCCA CGCATCAAAT CTATGGCTCA GAACTTTTTT ACCTCATAGG GCGTTGGTGG 360 

GGGTTTTTAG GCAACGCTCC TTGGAAAGAC TCCCTCATAG AATCTGACGC TCACACCCGT 42 0 

AATTATGTGC TGTATAATTC CTATCTGTTT TATTCTTATG GCGATAAATT CCACCTAAAA 480 

TTAGGGCGTT ATCTCTCTAA CATGGATTTT ATGAGTTCCT ACACACAGGG TTTTGAACTG 54 0 

GATTATAAAA TCAATTCTAA AATAGCGTTA AAATGGTTTA GCTCTTTTGG GAGGGCGTTG 6 00 

GCTTTTGGGC AATGGATACG GGATTGGTAT GCCCCTATTG TAACTGAAGA TGGCAGAAAA 660 

GAAGTTTATG ATGGCATCCA TGCCGCGCAA CTCTATTTTT CTAGCAAGCA TGTTCAAGTC 720 

ATGCCTTTTG CTTATTTTTC GCCTAAGATT TACGGAGCGC CCGGTGTTAA AATCCATATT 780 

GATAGCAACC CGAAATTCAA AGGCTTAGGG TTAAGGGCTC AAACCACTAT TAATGTGATT 84 0 

TTCCCTGTTT ATGCTAAAGA TTTATACGAT GTGTATTGGC GTAACTCTAA GATTGGCGAG 900 

TGGGGCGCAT CGCTTTTGAT CCACCAACGC TTTGACTACA ACGAATTTAA CTTTGGCTTT 960 

GGTTATTACC AAAATTTTGG CAACGCTAAC GCAAGGATTG GCTGGTATGG TAACCCCATC 1020 

CCTTTTAATT ATAGAAATAA CAGCGTTTAT GGTGGGGTCT TCAGTAACGC TATTACCGCA 108 0 

GACGCCGTTT CTGGGTATGT CTTTGGTGGG GGGGTGTATA GAGGGTTTTT ATGGGGTATT 114 0 

TTAGGCAGAT ACACTTATGC CACTAGAGCG AGCGAAAGAT CCATCAACTT GAACTTGGGC 1200 

TATAAATGGG GTTCTTTTGC TAGAGTTGAT GTGAATTTAG AATACTATGT GGTCAGCATG 1260 

CACAACGGCT ATAGATTAGA CTATCTCACC GGCCCTTTCA ACAAAGCCTT TAAGGCTGAC 1320 
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GCACAAGATA GGAGTAACCT TATGGTTAGC ATGAAATTCT TTTTTTAA 1368 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...849 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATGGGGTGTT CGTTTATCTT TAAAAAAGTT AGGGTTTATT CTAAAATGTT GGTTGCTTTG 
GGGCTTTCAA GCGTGTTGAT CGGTTGCGCG ATGAATCCAA GCGCTGAGAC AAAAAAACCA 
AATGACGCCA AAAACCAACA ACCAGTTCAA ACTCATGAAA GAATGACAAC AAGTTCTGAA 
CATGTTACGC CACTAGATTT TAATTACCCG GTGCATATTG TTCAAGCCCC ACAAAACCAT 
CATGTTGTAG GTATTTTAAT GCCACGCATT CAAGTGAGCG ATAATCTAAA ACCCTATATT 
GATAAGTTTC AAGACGCTTT AATTAATCAA ATCCAAACTA TTTTTGAAAA AAGAGGCTAT 
CAAGTGTTGC GTTTTCAAGA TGAAAAAGCT TTGAATGTGC AAGATAAGAA AAAGATTTTT 
TCCGTTTTGG ATTTGAAAGG GTGGGTAGGA ATCTTAGAAG ATTTGAAAAT GAATTTAAAA 
GATCCCAATA GTCCCAATTT AGACACGCTA GTGGATCAAA GCTCAGGCTC TGTATGGTTT 
^^^^TTATG AACCAGAAAG CAATCGTGTC GTCCATGATT TTGCTGTAGA AGTAGGAACT 
TTTCAGGCAA TAACATACAC ATACACCTCT ACTAATAACG CTTCAGGAGG GTTTAATTCT 
TCAAAAAGCG TTATCCATGA AAATTTGGAT AAGAATAGAG AAGACGCGAT ACACAAGATT 
TTAAACAGAA TGTATGCGGT TGTCATGAAA AAAGCTGTAA CAGAACTTAC AAAAGAAAAT 
ATCGCCAAAT ACAGAGACGC TATTGATAGA ATGAAAGGCT TTAAAAGTTC TATGCCTCAA 
AAAAAGTAG 

(2) INFORMATION FOR SEQ ID NO : 9 : 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
849 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 84 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 



(iv) ANTI- SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc__f eature 

(B) LOCATION 1 . . . 843 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGAAACTGA GAGCAAGTGT TTTAATCGGT GTGGCAATTC TGTGCTTAAT TTTAAGTGCG 60 

TGCAGTAACT ATGCGAAAAA AGTGGTGAAA CAAAAGAACC ATGTTTATAC GCCTGTGTAT 12 0 

AATGAACTGA TAGAGAAGTA TAGTGAGATC CCCTTAAATG ACAAACTCAA AGACACACCA 18 0 

TTCATGGTGC AAGTGAAGTT GCCAAATTAC AAGGACTATT TGTTGGATAA TAAACAAGTT 24 0 

GTACTAACTT TCAAACTTGT TCACCATTCT AAAAAGATTA CGCTCATAGG CGATGCCAAT 3 00 

AAGATCCTCC AATACAAGAA TTACTTCCAA GCTAACGGGG CAAGATCTGA CATTGATTTT 360 
TACTTGCAAC CCACTTTGAA TCAAAAGGGT GTGG TGATGA _TAGCGAGTAA_CTACAATGAT 2.0 

AATCCCAACA ACAAAGAAAA ACCACAGACC TTTGATGTGT TGCAAGGAAG TCAGCCAATG 4 80 

CTAGGAGCTA ACACAAAAAA CTTGCATGGC TATGATGTGA GTGGAGCAAA CAACAAGCAA 54 0 

GTGATCAATG AAGTGGCAAG AGAAAAAGCT CAGCTAGAAA AAATCAATCA GTATTACAAG 6 00 

ACTCTCTTGC AAGACAAGGA ACAAGAATAT ACCACTAGGA AAAATAACCA ACGAGAAATT 660 

TTAGAAACAT TGAGTAATCG TGCAGGTTAT CAAATGAGGC AGAATGTGAT TAGTTCTGAG 720 

ATTTTTAAGA ATGGCAACTT GAACATGCAA GCCAAAGAAG AAGAAGTTAG GGAGAAGCTA 780 

CAAGAAGAAA GAGAGAATGA ATACTTGCGC AATCAAATCA GAAGTTTGCT CAGTGGTAAG 84 0 

TGA 843 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .1179 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

ATGAGAAAAC TATTCATCCC ACTTTTATTA TTCAGCGCTT TAGAAGCGAA CGAGAAAAAC 60 

GGCTTTTTCA TAGAAGCCGG CTTTGAAACT GGGCTATTAG AAGGCACACA AACGCAAGAA 12 0 

AAAAGACACA CCACCACAAA AAACACTTAC GCAACTTACA ATTATTTACC CACAGACACG 18 0 

ATTTTAAAAA GAGCGGCTAA TTTATTCACC AATGCCGAAG CGATTTCAAA ATTAAAATTC 24 0 

TCATCTTTAT CCCCTGTTAG AGTGTTGTAT ATGTATAATG GTCAATTAAC TATAGAAAAC 300 

TTCTTGCCTT ATAATTTAAA TAATGTTAAG CTTAGTTTTA CAGACGCTCA AGGCAATGTG 360 

ATCGATCTAG GCGTGATAGA -GACTATCCCC AAACACTCTA AGATTGTTTT GCCCGGAGAG 42 0 
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GCATTTGATA GTCTAAAAAT TGACCCCTAT ACTTTATTTC TTCCAAAAAT TGAAGCCACT 480 

AGCACTTCTA TTTCTGACGC TAACACGCAG AGGGTGTTTG AAACGCTCAA TAAGATTAAG 54 0 

ACAAATTTGG TCGTAAATTA TAGGAATGAA AACAAATTTA AAGATCACGA AAATCATTGG 600 

GAAGCCTTTA CCCCACAAAC CGCAGAAGAA TTCACTAATT TAATGTTGAA CATGATCGCT 660 

5 GTTTTAGACT CCCAATCTTG GGGCGATGCG ATCTTAAACG CTCCTTTTGA GTTCACTAAC 720 

AGCCCAACAG ATTGCGATAA TGATCCTTCA AAATGCGTAA ATCCTGGGAC AAACGGGCTT 780 

GTCAATTCTA AAGTCGATCA AAAATATGTG TTAAACAAAC AAGACATTGT CAATAAATTT 840 

AAAAACAAAG CGGATCTTGA TGTAATTGTT TTAAAGGATT CAGGGGTTGT AGGGCTTGGG 900 

AGTGATATTA CCCCTAGCAA CAATGATGAT GGCAAGCATT ATGGCCAGTT AGGGGTAGTA 960 

10 GCTTCTGCTT TAGATCCTAA AAAACTCTTT GGCGATAACC TTAAGACTAT CAATTTAGAG 1020 

GATTTAAGAA CCATCTTGCA TGAATTCAGC CACACTAAAG GCTATGGGCA TAACGGGAAT 1080 

ATGACCTATC AAAGAGTGCC GGTAACGAAA GATGGTCAAG TGGAAAAGGA TAGTAATGGC 1140 

AAGCCAAAAG ATTCTGATGG CCTCCCCTAT AATGTGTGT 1179 

15 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...813 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATGAAAAAGT TTGTAGCTTT AGGGCTTCTA TCCGCGGTTT TAAGCTCTTC GTTGTTAGCC 60 

GAAGGTGATG GTGTTTATAT AGGGACTAAT TATCAGCTTG GACAAGCCCG TTTGAATAGC 120 

40 AATATTTATA ATACAGGGGA TTGCACAGGG AGTGTTGTAG GTTGCCCCCC AGGTCTTACC 180 

GCTAATAAGC ATAATCCAGG AGGCACCAAT ATCAATTGGC ACTCCAAATA CGCTAATGGG 240 

GCTTTGAATG GTTTTGGGTT GAATGTGGGT TATAAGAAAT TCTTCCAATT CAAGTCGCTA 3 00 

GATATGACAA GCAAGTGGTT TGGTTTTAGA GTGTATGGGC TTTTTGATTA CGGGCATGCC 360 

GATTTAGGTA AACAAGTTTA TGCACCTJ^T AAAATCCAGT TGGATATGGT CTCTTGGGGT 420 

45 GTGGGGAGCG ATTTGTTAGC TGATATTATT GATAAAGACA ACGCTTCTTT TGGTATTTTT 480 

GGTGGGGTCG CTATCGGCGG TAACACTTGG AAAAGCTCTG CAGCAAACTA TTGGAAAGAG 540 

CAAATCATTG AAGCCAAAGG TCCTGATGTT TGTACCCCTA CTTATTGTAA CCCTAATGCC 6 00 

CCTTATAGCA CCAACACTTC AACCGTCGCT TTTCAAGTGT GGTTGAATTT TGGGGTGAGA 66 0 

GCCAATATCT ACAAGCATAA TGGCGTGGAA TTTGGCGTGA GAGTGCCGCT ACTCATCAAT 720 

50 AAATTTTTGA GCGCGGGTCC TAACGCTACT AACCTTTATT ACCATTTGAA ACGGGATTAT 78 0 

TCGCTTTATT TGGGGTATAA CTACACTTTT TAA 81^ 

(2) INFORMATION FOR SEQ ID NO : 12 : 
55 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 423 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : c i r cular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(B) LOCATION 1. . .423 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATGCATCCTA TAATGTTTGC CTATATCGCT AACGCGCTCG CTCAAGCTAG AAAGATCAAC 6 0 

GGAACACTTT GCATGGCGTT TCAAAAAATA TCTCJ^AGTCA AAGAATTAGG CATTGATAAA 12 0 

GCAAAGAGTT TGATAGGCAA CCTTTCTCAA GTGATTATCT ACCCCACAAA AGATACTGAT 18 0 

GAATTAATAG AATGTGGCGT CCCATTAAGC GATAGTGAAA TCAATTTCTT ACACAACACG 240 

GACATGAGAG CCAGACAAGT GCTAGTAAAA AATATCGTTA CAAACGCTTC AGCTTTTATT 3 00 

GAAATTGATT TAAAAAAGAT TTGCAAGAAC TACTTTATAT TCTTGATAGC AATGCTGGTA 360 

ATAGAAAAAT CCTCAATGAT CTTAAAAAAG CAAACCAAGA AACTTATAAG GAAGAGTATT 420 



(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .771 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 ; 



ATGTTGGGGA GCGTCAAAAA AGCGGTTTTT AGGGTTTTGT GTTTGGGGGC GTTGTGTTTA 
TGCGGGGGGT TAATGGCAGA GCAAGATCCT AAAGAGCTTA TATTTTCAGG TATAACTATT 
TACACGGATA AAAATTTCAC TAGAGCTAAG AAATATTTTG AAAAAGCTTG CAAATCAAAC 



60 
120 
180 
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GATGCTGATG GCTGTGCAAT CTTAAGAGAG GTTTATTCTA GTGGTAAAGC CATAGCGAGA 240 

GAAAACGCAA GAGAGAGCAT TGAAAAAGCT CTTGAACACA CCGCTACTGC TAAAGTTTGT 3 00 

AAATTAAACG ATGCTGAAAA ATGCAAGGAC TTAGCAGAGT TTTATTTTAA TGTAAACGAT 360 

CTTAAAAATG CTTTAGAATA TTACTCTAAA TCTTGTAAGT TAAATAATGT TGAAGGGTGT 420 

ATGCTGTCAG CAACTTTTTA TAACGATATG ATAAAGGGTT TGAAAAAAGA TAAAAAAGAT 4 80 

CTAGAATATT ATTCTAAAGC TTGCGAGTTA AATAACGGTG GAGGGTGTTC TAAATTAGGA 540 

GGGGATTATT TTTTTGGTGA AGGCGTAACA AAAGATTTCA AAAAAGCTTT TGAATATTCT 6 00 

GCCAAAGCTT GTGAGTTGAA CGATGCTAAA GGGTGTTACG CTCTAGCAGC GTTTTATAAT 660 

GAGGGTAAAG GCGTGGCAAA GGATGAAAAG CAAACGACAG AAAACCTTGA AAAGAGTTGC 720 

AAGCTAGGAT TAAAAGAAGC ATGCGATATT CTCAAAGAAC AAAAACAATA A 771 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 9 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : c i r cul ar 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .729 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

ATGAAAAAAT TTTTTTCTCA ATCTTTGTTA GCTCTTATTA TCTCTATGAA TGCGGTATCT 60 

GGCATGGATG GTAATGGCGT TTTTTTAGGG GCGGGTTATT TGCAAGGACA GGCGCAAATG 120 

CATGCGGATA TTAATTCTCA AAAACAAGCC ACCAACGCTA CGATCAAAGG CTTTGACGCG 180 

CTCTTGGGGT ATCAATTTTT CTTTGAAAAA CACTTTGGCT TACGCCTTTA TGGGTTTTTT 240 

GACTACGCTC ATGCCAATTC TATTAAGCTT AAAAACCCTA ACTATAATAG CGAAGCGGCG 300 

CAAGTGGCTA GTCAAATTCT TGGGAAACAA GAAATCAATC GTTTAACAAA CATTGCCGAT 360 

CCCAGAACTT TTGAGCCGAA CATGCTCACT TATGGGGGGG CTATGGACGT GATGGTTAAT 420 

GTCATCAATA ACGGCATCAT GAGTTTGGGG GCTTTTGGCG GGATACAATT GGCCGGCAAT 4 80 

TCATGGCTTA TGGCGACACC GAGCTTTGAG GGCATTTTAG TGGAACAAGC CCTTGTGAGC 540 

AAGAAAGCCA CTTCTTTCCA ATTTTTATTC AATGTGGGGG CTCGCTTAAG GATCTTAAAA .600 

CATTCTAGCA TTGAAGCGGG CGTGAAATTC CCCATGCTAA AGAAAAACCC CTACATCACT 660 

GCAAAAAATT TGGATATAGG GTTTAGGCGC GTGTATTCGT GGTATGTGAA TTACGTGTTC 720 
ACTTTCTAG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



729 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1.,.804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



ATGAACTACC CTAATCTACC TAACAGCGCT TTAGAGATAA GCGAACAGCC AGAAGTGAAA 60 

GAAATCACTA ACGAGCTTTT AAAGCAATTA CAAAACGCTT TAAGGAGCAA CGCGCATTTT 120 

AGCGAGCAAG TGGAATTAAG CCTTAAATGC ATCGTTAGGA TTTTAGAAGT GCTTTTGAGT 18 0 

TTGGATTTTT TTAAGAATGC GAATGAGATT GATAGCAGTT TAAGAAATTC CATTGAGTGG 2 40 

CTGACTAACG CCGGCGAGAG CTTGAAATTA AAAATGAAAG AATACGAGCG CTTTTTTAGC 3 00 

GAGTTTAATA CGAGCATGCA TGCCAACGAG CAGGAAGTAA CCAATACCTT AAACGCTAAC 360 

GCCGAGAACA TTAAAAGCGA AATTAAAAAG CTAGAAAATC AATTGATAGA AACCACGACA 420 

AGACTTTTAA CGAGCTATCA AATCTTTTTA AACCAAGCCA GAGATAACGC TAACAACCAA 480 

ATCACAAAAA ACAAAACCCA AAGCCTTGAA GCGATTACAC AAGCTAAAAA CAACGCTAAT 54 0 

AATGAAATAA GCAACAATCA AACGCAAGCG ATAACTAATA TCACCGAAGC GAAAACGAAC 600 

GCTAATAATG AAATAAGCAA CAATCAAACG CAAGCGATAA CTAACATTAA CGAAGCCAAA 660 

GAAAGCGCTA CAACGCAAAT AAACGCCAAT AAGCAAGAAG CAATAAATAA CATCACGCAA 720 

GAAAAAACCC AAGCCACAAG CGAGATCACC GAAGCGAAAA AGACCGATCA TTATCAAAAC 780 
ATTGATTTTT TTGAGTTTGA ATAA . 8 04 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1632 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : doilble 

( D ) TO POLOG Y : c i r cul a r 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .1632 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
GTGATAGAGA CCATCCCCAA ACACTCTAAG ATTGTTTTAC CCGGGGAGGC GTTTGATAGT 6 0 
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TTAAAAGAGG CGTTTGATAA AATTGACCCC TATACTTTCT TTTTTCCAAA ATTTGAAGCC 120 

ACTAGCACTT CTATTTCTGA TACTAACACG CAGAGGGTGT TTGAAACGCT CAATAACATT 180 

AAAACAAATC TTATAATGAA ATATAGTAAT GAAAATCCAA ACAATTTCAA CACTTGTCCT 240 

TACAATAATA ATGGTAATAC AAAAAATGAT TGTTGGCAAA ATTTCACCCC ACAAACCGCA 300 

GAAGAATTCA CCAATTTAAT GTTGAACATG ATCGCTGTCT TAGACTCCCA ATCTTGGGGC 360 

GATGCGATCT TAAACGCTCC TTTTGT^TTC ACTAACAGCT CAACAGATTG CGATAGCGAT 420 

CCTTCAAAAT GCGTAAATCC CGGAGTAAAT GGGCGTGTTG ATACTAAAGT CGATCAACAA 480 

TATATACTCA ACAAACAAGG TATTATTAAT AATTTTAGAA AAAAAATAGA AATTGATGCG 540 

GTTGTTTTAA AAAATTCAGG GGTTGTAGGG TTAGCCAATG GATATGGCAA TGATGGTGAA 600 

TATGGCACAT TAGGGGTAGA AGCCTATGCT TTAGATCCTA AAAAACTCTT TGGCAACGAC 660 

CTTAAGACTA TCAATTTAGA AGATTTAAGA ACCATCTTGC ATGAATTCAG CCACACTAAA 720 

GGCTATGGGC ATAACGGGAA TATGACCTAT CAAAGAGTGC CGGTAACGAA AGATGGTCAA 780 

GTGGAAAAGG ATAGTAATGG CAAGCCAAAA GATTCTGATG GCCTCCCCTA TAATGTGTGT 840 

TCGCTTTATG GGGGATCCAA TCAGCCCGCT TTCCCTAGCA ACTACCCTAA TTCCATCTAT 900 

CACAATTGTG CGGATGTCCC GGCTGGCTTT TTAGGGGTAA CAGCAGCGGT TTGGCAGCAG 960 

CTCATCAATC AAAACGCCTT GCCGATCAAC TACGCTAACT TGGGGAGTCA AACAAACTAC 1020 

AACCTAAACG CTAGTTTAAA CACGCAAGAT TTAGCCAATT CCATGCTCAG CACCATCCAA 1080 

AAAACCTTTG TAACTTCTAG CGTTACCAAC CACGATTTTT CAAACGCATC GCAAAGTTTT 1140 

AGAAGCCCTA TTTTAGGGGT TAACGCTAAA ATAGGCTATC AAAACTACTT TAATGATTTC 1200 

ATAGGGTTGG CTTATTATGG CATCATCAAA TACAATTACG CTAAAGCTGT TAATCTUUUU^ 126 0 

GTCCAGCAAT TGAGCTATGG TGGGGGGATA GATTTGTTAT TGGATTTCAT CACCACTTAC 13 20 

TCCAATAAAA ATAGCCCTAC AGGCATTCAA ACCAAAAGGA ATTTTTCTTC ATCTTTTGGT 13 80 

ATCTTTGGGG GGTTAAGGGG CTTGTATAAC AGCTATTATG TGTTGAACAA AGTCAAAGGA 1440 

AGCGGCAATT TAGATGTGGC TACCGGGTTG AACTACCGCT ATAAGCATTC TAAATATTCT 1500 

GTAGGGATTA GCATCCCTTT AATCCAAAGA AAAGCTAGCG TCGTTTCTAG CGGTGGCGAT 1560 

TATACGAACT CTTTTGTTTT CAATGAAGGG GCTAGCCACT TTAAGGTGTT TTTCAATTAC 162 0 

GGTGGGTGTT TT 1632 

(2) INFORMATION FOR SEQ ID NO: 17: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1071 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mis Cofeature 

(B) LOCATION 1...1071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TTGATGAAAA GCATTTTGCT CTTTATGATT TTTGTAGTTT GTCAGTTAGA AGGCAAAAAA 60 

TTTTCACAAG ATAATTTTAA GGTGGATTAT AACTACTATT TGCGCAAACA GGATTTGCAC 12 0 

ATCATTAAAA CGCAAAACGA TTTGTCCAAT GCCTGGTATC TCCCTCCACA AAAAGCCCCC 18 0 

AAAGAACATT CTTGGGTGGA TTTTGCTAAA AAATATTTAA ACATGATGGA TTATCTAGGC 24 0 
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ACTTATTTTT TGCCTTTTTA TCATAGTTTC ACCCCCATTT TTCAATGGTA CCACCCTAAT 300 

ATCAACCCCT ACCAACGCAA TGAGTTTAAG TTCCAAATCA GTTTTAGAGT GCCTGTATTT 360 

AGGCATATTC TTTGGACTAA AGGCACGCTT TATCTGGCTT ATACCCAAAC TAACTGGTTT 420 

CAAATTTATA ATGACCCTCA ATCCGCCCCC ATGCGAATGA TCAATTTCAT GCCTGAACTC 480 

ATCTATGTTT ATCCTATTAA TTTTAAACCT TTTGGGGGTA AAATAGGGAA TTTTTCTGAA 540 

ATTTGGATAG GTTGGCAGCA CATTTCTAAT GGTGTGGGGG GTGCGCAATG TTACCAGCCT 600 

TTTAATAAAG AAGGTAATCC TGAAAACCAG TTTCCAGGAC AACCTGTAAT CGTTAAAGAT 660 

TATAACGGGC AAAAAGATGT GCGCTGGGGG GGGTGTCKTT CGGTGARCSC GGGCAACSCC 720 

CTGTGTTTCG TTTTGGTGTG GGAAAAGGGA GGCCTAAAAA TCATGGTCGC TTATTGGCCC 780 

TATGTCCCTT ATGATCAATC CAACCCTCAA TTGATTGATT ACATGGGGTA TGGTAACGCT 84 0 

AAAATTGATT ACAGGAGAGG GCGCCACCAT TTTGAATTGC AACTTTATGA TATTTTCACG 900 

CAATACTGGC GTTATGATCG CTGGCATGGA GCTTTCCGCT TAGGCTATAC CTACCGCATT 960 

AACCCTTTTG TGGGGATTTA TGCGCAGTGG TTTAACGGCT ATGGCGATGG CTTGTATGAA 1020 

TACGATGTTT TTTCCAATCG TATAGGGGTA GGAATACGCT TGAACCCTTA A 1071 

(2,) INEORMATIDN F_OR .SEQ .ID NO„-!_18 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 02 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .2028 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

TTGTCTAAAG GTTTGAGTAT CGGTAATAAA ATCATATTGT GCGTGGCGTT GATTGTGATC 60 

GTGTGCGTGA GCATTTTAGG GGTGTCCTTA AACAGCAGGG TGAAAGAGAT TTTAAAAGAA 120 

AGCGCTCTGC ATTCAATGCA AGATAGTTTG CATTTCAAGG TTAAGGAAGT GCAAAGTGTT 180 

TTGGAAAACA CTTATACGAG CATGGGCATT GTCAAAGAAA TGCTCCCTGA AGACACCAAA 240 

AGAGAAATCA AAATCCAGTT GTTAAAAAAC TTCATTTTAG CCAATTCGCA TGTCGCTGGG 300 

GTGAGCATGT TTTTTAAAGA CAGAGAGGAT TTGAGATTGA CGCTTTTACG AGATAACGAT 3 60 

ACGATCAAGT TGATGGAAAA CCCGTCATTA GGGAGTAACC CTTTAGCGCA AAAAGCGATG 420 

AAAAATAAAG AAATTTCTAA AAGCTTGCCT TATTACAGGA AAATGCCTAA CGGGGCGGAA 480 

GTTTATGGCG TGGATATTCT TTTACCACTA TTCAAGGAAA ACACGCAAGA AGTGGTGGGG 540 

GTTCTGATGA TTTTCTTTTC CATTGACAGC TTCAGTAATG AAATCACTAA AAACAGGAGC 600 

GATTTATTTT TAATTGGCGT TAAAGGTAAA GTGCTTTTGA GCGCGAATAA AAGCTTGCAA 660' 

GACAAATCCA TCACCGAAAT TTATAAAAGC GTGCCTAAAG CCACTAATGA AGTGATGGCT 720 

ATTTTAGAAA ATGGCTCTAA AGCGACTTTA GAATACTTGG ATCCCTTTAG CCATAAGGAG 780 

AATTTTTTAG CCGTTGAAAC CTTTAAAATG CTAGGCAAAA CAGAAAGTAA AGACAATCTT 84 0 

AATTGGATGA TCGCTTTGAT CATTGAAAAA GACAAGGTCT ATGAGCAAGT GGGATCGGTG 900 

CGTTTTGTGG TGGTTGCAGC GAGTGCTATC ATGGTGTTAG CCTTAATCAT AGCGATCACT 960 

CTTTTAATGC GAGCGATCGT GAGCAATCGT TTGGAAGTCG TTTCTAGCAC CTTGTCTCAT 102 0 
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TTCTTTAAAT TATTGAACAA TCAAGCCCAT TCTAGCGACA TTAAATTGGT TGAAGCGCGA 1080 

TCTAATGACG AATTAGGGCG CATGCAAACA GCGATCAATA AAAATATCTT GCAAACCCAA 1140 

AAAACCATGC AAGAAGACAG GCAAGCCGTC CAAGACACCA TTAAAGTGGT TTCAGACGTG 1200 

AAAGCGGGGA ATTTTGCGGT GCGCATCACG GCTGAACCCG CAAGCCCTGA TTTGAAAGAA 1260 

TTGAGAGACG CGCTAAATGG GATCATGGAT TATTTGCAAG AAAGCGTAGG GACTCACATG 1320 

CCAAGCATTT TCAAAATCTT TGAAAGCTAT TCTGGCTTGG ATTTTAGAGG GCGGATCCAA 1380 

AACGCTTCGG GTAGGGTGGA ATTGGTTACT AACGCTTTAG GGCAAGAAAT CCAAAAAATG 1440 

CTAGAAACTT CGTCTAATTT TGCCAAAGAT CTAGCGAACG ATAGCGCGAA TTTAAAAGAA 1500 

TGCGTGCAAA ATTTAGAA?^ GGCTTCAAAC TCCCAACACA AAAGCCTGAT GGAAACTTCC 1560 

AAAACGATAG AAAATATCAC CACTTCCATT CAAGGCGTGA GCTCTCAAAG TGAAGCCATG 1620 

ATTGAACAAG GGAAAGACAT TAAAAGCATT GTAGAAATCA TTAGAGATAT TGCCGATCAA 168 0 

ACGAATCTAT TAGCCCTAAA CGCTGCTATT GAAGCCGCAC GAGCCGGCGA GCATGGCAGA 174 0 

GGCTTTGCGG TGGTGGCTGA TGAGGTGAGG AAGCTCGCTG AAAGGACGCA AAAATCCCTC 1800 

AGTGAGATTG AAGCCAATAT TAATATTCTC GTTCAAAGCA TTTCAGACAC GAGCGAAAGC 1860 

ATTAAAAACC AGGTTAAAGA AGTAGAAGAG ATCAACGCTT CTATTGAAGC CTTAAGATCG 1920 

GTTACTGAGG GCAATCTAAA AATCGCTAGC GATTCTTTAG AAATCAGTCA AGAAATTGAC 198 0 

AAAGTCTCTA ACGATATTTT AGAAGATGTG AATAAAAAGC AGTTTTAA 2 028 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .816 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 



ATGAACATAT TCAAGCGTAT TATTTGCGTA 
TTAGACGCCA AACACCACAA AGAAAAAAAA 
GTGGGCGCTA ACCCTGTGCC GCATGCGCAA 
GAGAAAGGGA TCAAATTAGT GATCGTGTCT 
CTCAATGACG GCTCTTTAGA CGCGAATTAC 
AATTTGGACA GAAAAATGCA CCTTGTTGGT 
TTTTATTCTC AAAAAATCAC AGACATTAAA 
CCAAATGATC CGGCCAATCA AGGCAGGGCG 
GCTCTCAAAG ACCCAAGCAA TCTATACGCT 
AACATCAAAA TCAAACCCCT AGAAGCTGCG 
GGGGCTATCA TAACAGGGAA TTATGCCTTG 
GAAGATAAGG ACTCGCCTTA TGCTAATCTT 
GAAGCGATAA AAGCGTTGAT TGAAGCCTTA 
GATACCTATA AGGGGGCGAT TATCCCGGCT 



ACCGCTATTG TTTTAGGTTT TTTTAACCTT 60 

GAAGACCACA AAATCACTCG TGAGCTTAAA 120 

ATCTTGCAAT CAGTTGTGGA TGATTTGAAA 180 

TTTACGGATT ATGTGTTGCC TAATTTAGCG 24 0 

TTCCAGCACC GCCCTTATTT GGATCGGTTT 3 00 

TTGGCCAATA TCCATGTGGA GCCTTTAAGA 360 

AACCTTAAAA AAGGCTCAGT GATTGCTGTG 42 0 

TTGATTTTAC TCCATAAACA AGGCCTTATC 480 

ACGGAGTTTG ATATTGTCAA AAATCCTTAC 540 

TTATTGCCTA AGGTTTTAGG GGATGTGGAT 600 

CAAGCAAAAC TCACCGGAGC CTTATTTTCA 660 

GTAGCCTCTC GTGAGGATAA TGCGCAAGAT 720 

CAGAGCGAAA AGACCAGGAA ATTCATTTTG 780 

TTTTAA 816 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

_( vi ORIGINAL SOURC E: _ 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...486 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATGTTTTTTA AAACTTATCA AAAATTACTG GGCGCGAGCT GTTTGGCGCT GTATTTAGTG 60 

GGCTGTGGGA ATGGTGGTGG CGGTGAATCG CCGGTTGAGA TGATTGCAAA TAGCGAGGGT 12 0 

ACGTTTCAAA TCGACTCCAA AGCAGATAGC ATTACTATTC AAGGCGTGAA GCTTAATAGA 18 0 

GGTAATTGTG CTGTCAATTT TGTTCCAGTA AGTGAGACGT TTCAAATGGG TGTTTTAAGT 240 

CAAGTTACTC CAATCTCTAT ACAGGATTTT AAAGATATGG CAAGCACTTA TAAGATATTT 3 00 

GATCAAAAGA AAGGGTTGGC T^AACATAGCA AATAAAATTT CTCAATTAGA GCAAAAGGGT 360 

GTGATGATGG AACCTCAAAC CCTTAATTTT GGAGAAAGTT TAAAAGGCAT TTCTCAAGGG 420 

TGCAATATTA TAGAGGCAGA AATACAAACC GACAAAGGCG CTTGGACTTT TAACTTTGAT 480 

AAATAA 4 86 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .1014 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATGATTAGAT TAAAAGGTTT GAATAAAACT TTAAAAACAA GCTTATTAGC TGGGGTTTTA 60 

CTAGGTGCTA CTGCTCCCTT AATGGCAAAG CCTTTATTAA GCGATGAAGA CTTATTGAAA 120 

CGAGTAAAAC TACACAATAT CAAAGAAGAT ACGCTGACTA GCTGTAATGC TAAGGTGGAC 18 0 

GGCTCTCAAT ACTTGAATAG TGGTTGGAAT TTATCTAAAG AATTTCCGCA AGAATATAGA 24 0 

GAAAAGATTT TTGAATGCGT AGAAGAAGAA AAACATAAAC AAGCCCTTAA TTTAATCAAT 300 

AAAGAAGACA CTGAAGATAA AGAAGAACTT GCAAAAAAAA TCAAAGAAAT TAAAGAAAAA 360 

GCTAAAGTTT TAAGGCAAAA ATTTATGGCT TTTGAAATGA AAGAACACTC TAAAGAATTC 42 0 

CCAAATAAAA AGCAACTTCA AACCATGCTT GAGAACGCTT TTGATAATGG AGCTGAAAGT 48 0 

TTTATTGATG ATTGGCACGA ACGCTTTGGG GGTATAAGTA GAGAGAATAC TTATAAAGCA 540 

CTTGGCATTA AAGAATATAG TGATGAAGGA AAGATATTAG CCTTTGGCGA AAGAAGTTAT 600 

ATTAGACAAT ATAAAAAAGA TTTTGAAGAA AGCACTTATG ATACTAGACA AACCTTATCT 660 

GCTATGGCTA ATATGAGTGG CGAAAACGAT TATAAAATTA CTTGGTTAAA ACCCAAATAT 720 

CAGCTCCATA GTTCAAATAA TATTAAACCC TTAATGTCAA ACACAGAGTT GTTAAATATG 780 

ATAGAGCTAA CCAATATCAA AAAAGAATAT GTTATGGGCT GTAATATGGA AATAGATGGT 840 

TCTAAATATC CCATTCATAA AGATTGGGGA TTTTTTGGTA AGGCAAAAGT CCCAGAAACT 900 

TGGAGAAATA AGATTTGGGA ATGTATTAAG AATAAAGTAA AGTCCTATGA CAACACTACC 960 

GCTGAAATAG GAATAGTTTG GAAAAAAAAT ACTTATTCTA TCTCTCATCA CTAA 1014 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TO POLOG Y : c i r cul ar 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mis Cofeature 

(B) LOCATION 1...1251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ATGAAAAAAT TAGTTTTTAG CATGCTTTTA TGTTGTAAAA GCGTGTTTGC AGAGGGGGAA 60 

ACTCCTTTGA TTGTCAATGA CCCAGAAACC CATGTAAGTC AAGCCACTAT CATAGGCAAA 120 

ATGGTAGATA GTATCAAAAG ATACGAAGAG ATTATTTCTA AGGCTCAAGC TCAAGTCAAT 180 

CAGTTACAAA AAGTCAATAA CATGATAAAT ACGACTAATT CTTTGATTAG TAGTAGTGCT 240 

ATCACTTTAG CCAATCCTAT GCAAGTTTTA CAAAACGCTC AGTATCAAAT AGAGAGCATT 300 

AGATACAACT ATGAGAATTT AAAGCAAAGC ATAGAAAATT GGAACGCACA AAATTTGTTA 360 

AGAAACAAAT ACTTACAGCA ACAATGCCCT TGGCTTAATG TCAATGCTCT TACTAACAAT 420 

AAGATTGTCA ATCTTAAAGA TCTCAATAAC CTAATCACCA AAAATGGCGA ACAAACCCAA 480 

ACCGCAAGAG ATGTGCAAAA TCTCATTCAG TCCATTAGTG GCAGTGGCTA TGGAAACATG 540 

CAATCACTTG CTGGGGAATT GAGTGGTAGA GCGTGGGGGG AAATGTTGTG TAAAATGGTA 6 00 

AACGATAGTA ATTATGAAAG CGAGCAAGCT CTTTTAGCAA CAGGCAATAA CCCAGAAGAG 660 

CAAAAACGAA GATTTTTGCT TAGAGTAAAG AAAAAGGTTA ATGATAATAA GCAGTTAAAA 720 
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GATAAACTTG ACCCATTTCT AAAAAGACTT GATGTCCTAC AAACTGAGTT TGGTGTAACT 780 

GACCCTACAG CTAACCATAA TAAGCAAGGG ATACATTATT GCACAGAAAA TAAAGAGACA 84 0 

GGTAAATGCG ACCCTATTAA AAATGTATTT AGGACAACTC GCTTAGATAA CGAATTAGAA 900 

CAAGAAATCC AAACGCTCAC ACTTGATTTA ATCAAAGCCT CCAATAAAGA CGCTCAAAGC 96 0 

CAAGCCTACG CAAATTTCAA TCAAAGGATT AAATTACTTA CTCTAAAATA TTTAAAAGAA 1020 

ATTACCAATC AAATGCTCTT TTTAAATCAA ACAATGGCAA TGCAAAGCGA GATTATGACA 1080 

GATGATTATT TTAGGCAi\AA TAATGATGGC TTTGGGGAAA AAGAAAACCA TATAGACAAA 114 0 

CAATTAACGC AAAAAAGAAT AAACGAAAGA GAAAGAGCTA GAATATACTT TCAAAACCCT 1200 

AATGTTAAAT TTGACCAATT TGGCTTTCCC ATTTTTAGTA TATGGGATTA A 12 51 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1131 base pairs 

(B) TYPE: nucleic acid 
-(C) -STRANDEDNESS-: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...1131 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GTGAATAAGT GGATTAAAGG GGCGGTTGTT TTTGTAGGGG GTTTTGCAAC GATTACAACC 60 

TTTTCTTTAA TCTACCACCA AAAGCCAAAA GCCCCCCTAA ATAACCAGCC TAGCCTTTTG 12 0 

AATGACGATG AGGTGAAATA CCCCTTACAA GACTACACTT TCACTCAAAA CCCACAGCCA 180 

ACTAACACGG AAAGCTCCAA AGACGCTACC ATCAAAGCCT TACAAGAACA GCTCAAAGCC 24 0 

GCTTTAAAAG CCCTAAACTC CAAAGAAATG AATTATTCCA AAGAAGAGAC TTTTACTAGC 3 00 

CCTCCCATGG ATCCAAAAAC AACCCCCCCT AAAAAAGACT TTTCTCCAAA ACAATTAGAT 360 

TTACTGGCCT CTCGCATCAC CCCTTTCAAG CAAAGCCCTA AAAATTACGA AGAAAACCTG 42 0 

ATTTTCCCTG TGGATAACCC TAATGGCATT GATAGTTTCA CTAACCTTAA AGAAAAAGAC 480 

ATCGCCACTA ATGAAAACAA GCTTTTACGC ACCATTACAG CTGACAAAAT GATACCCGCT 540 

TTTTTGATTA CGCCCATTTC TAGCCAGATC GCTGGTAAAG TGATTGCGCA AGTGGAGAGC 600 

GATATTTTTG CAAGCATGGG CAAAGCCGTC TTAATCCCCA AAGGCTCTAA AGTCATAGGC 660 

TATTACAGCA ACAATAACAA AATGGGCGAA TACCGCTTGG ATATTGTATG GAGTCGAATC 720 

ATCACTCCCC ATGGCATTAA TATCATGCTC ACTAACGCTA AAGGGGCGGA CATTAAAGGC 780 

TATAACGGCT TAGTGGGGGA ATTGATTGAA AGGAATTTCC AACGCTATGG CGTGCCGTTA 84 0 

CTGCTTTCTA CGCTCACTAA CGGCCTATTG ATTGGGATCA CTTCGGCTTT AAACAACAGA 900 

GGCAATAAAG AAGAGGTGAC TAATTTCTTT GGGGATTATC TTTTATTGCA ATTGATGAGG 960 

CAAAGCGGCA TGGGGATCAA TCAAGTGGTC AATCAAATTT TAAGAGACAA GAGCAAGATC 1020 

GCCCCCATTG TGGTGATTAG AGAGGGGAGT AGGGTCTTCA TTTCGCCCAA TACTGACATC 108 0 

TTCTTCCCTA TACCCAGAGA GAATGAAGTC ATCGCTGAGT TTTTGAAGTG A 1131 



(2) INFORMATION FOR SEQ ID NO:24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...2751 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



10 



15 



GTGGATTTGA GGATCCAATC TAAAGAAGTC AGTCATAATT TAAAGGAATT ATCAAAAACG 
CTAATCAGCT ATCCTTTTGA AAAACATGTA GAAGCTTTAG GGGAACAATG CAGTAACTTC 
GTTTCTATTC CCATTAACAA TGACGACTAT TCAAATATTT GCACTTTTGT GAGTGATTTT 
25 ATAAATCTTA TAGCTTCTTA CAATTTATTA GAATCATTTT TAGATTTTTA TAAAGATAAA 
TTAAAATTGA GCGAGCTTGT AACTGAATAT GCCAACGTAA CCAATAATCT GCTTTTCAAA 
AAATTAATCA AACATTTAAG CGGCAACAAT CAATTGGTTA AAAATTTTTA TCAGTGTATA 
AGAGAAATTA TAAAATACAA CGCCCCTAAT AAAGAATACA AACCCAATCA ATTTTTTATA 
ATAGGGAAAG GCAAACAAAA ACAATTAGCA AAAATTTATT CTCATTTAAA AGAACTTAGT 
30 GCAAGTGAAA TTAAACCACA AGATATGGAA GACATCTTAA AAAAGCTAGA GGAATTAGAT 
AAAATTTTTA AAACTACCGA CTTTACAAAA TTCACACCAA AAACTGAAAT TAAGGATATT 
ATTAAAGAAA TAGACGAAAA ATACCCTATC AATGAAAATT TTAAACGGCA ATTTAATGAG 
TTTGAATCAA ATATTGAAAA ACATGATGAA ATAAAAAAGG ATTTTGAGCG AAACAAAGAG 
TCGCTGATCC GAGAAATTGA AAATCACTGC AAAAATGAAT GCAATAGCGA AGAAGAGCCG 
35 GAGTATAAGA TTAATGATCT GCTCAAAAAT ATCCAACAAA TATGCAAAAA TTATATAGAA 
AGTCATGCCG TTAATGATGT GTCTAAAGAT ATTAAATCCA TGATGTGTCA GTTTTATTTG 
AAACAGATAG ATTTATTAGT CAATTCAGAA ATTGTGCGAT ACAGATACAG CAATCTTTTT 
GAACCAATAC AAAGATCTTT ATGGGAGAGT ATAAAAATTT TAGATAATGA AAGTGGCATT 
TATTTGTTCC CTAAAAATAT TGGTGAAATC AAGGATAAAT TTGAAGCAAA CAAGGAAAAA 
40 TTCAAACAAA GCAAAAATGT TTCTGAGTTC GCAGAATATT GCCGAGAGTG TAACCCCTAT 
ACAGCGTTTA ACTTTCATCT AAATATAAAT AATGGTTTAT CTCATCAATT TGAAAAATTC 
GTGCCAATCA TGAAAGAATA CAAAGAGCCA AAAATCACAG ATAATGACCT TGAAGCCATA 
TCAACCAAAG AGACTGGTCT TGCTAGCCAA TTATCTGGGC ACTGGTTTTT TCAGCTTTCG 
TTATTTAATA AAACAAACTT TAATCCTAAT AAAATTTGGA TTCCTTTAGA GTTCAATAAA 
45 AGATCAAAAA TAAAGTTTGA TAAAGATTTA GAAATCTATT TTGATAGTCA TGAATCGTTC 
AATATCTCTA AAAAATACTT GCAAGAAATA GATCAAGAAT CACTAAAAAA GATCAAACAA 
TCAAAAGATT TTTTTTCAAT TCAAAAAATA GAGAGTAAGC ATGATAATAA CGATATACTG 
CAACTTGAAT TTTTTGAGAA TGATACAAGT TTTCTTTTTG CTAAAGGAAG TTTTGCAGAA 
ATTTTAGAAT ACAACATGCA ATTAAAAATA GATTCTTTAA TTACAAAAGA ATTTAATAAG 
50 CTTTTAGCGA TCGTTCAAGA TAGTCCCCAA GATAGTTACC AATTAAAAAT TCGTGTCCGA 
CATAACAATA AGCTTCCTAG AGAGAAATAT ACGGAACATG AAATAAAACT TGAAGTTTAT 
GATTGCAGAA AATCCCACGA TCACAATGAG CCAATCATCT TAAGCCAGCA AAGCACCGGC 
TTCCAATGGG CGTTTAATTT CATGTTTGGC TTTCTTTATA ATGTGGGATC ACATTTTAGT 1920 
TTTAACCATA ATATTATCTA TGTCATGGAC GAGCCAGCCA CTCATTTGAG CGTGCCAGCC ^ 
55 AGAAAGGAGT TTAGGAAATT TTTAAAAGAA TACGCTCATA AAAATCATGT TACTTTTGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 8 0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



1980 
2040 
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TTAGCCACCC ATGACCCCTT TTTAGTGGAT ACGGATCATT TAGATGAAAT TLAGGATTGTG 2100 

GAAAAGGAAA CAGAAGGCTC TGTAATTAAG AATCACTTTA ACTATCCCCT AAATAATGCA 2160 

AGCAAAGACT CCGACGCTTT GGACAAAATC AAACGCTCTT TAGGAGTGGG CCAGCATGTT 2220 

TTTCATAACC CCCAAAAACA CCGAATCATT TTTGTAGAAG GCATCACGGA TTATTGTTAT 22 80 

TTGAGCGCTT TTAAATTGTA TTTGCGTTAC AAAGAATACA AGGAC/^CCC CATTCCTTTC 2340 

ACTTTCTTAC CCATTTCAGG GCTTAAAAAC GATTCAAACG ATATGAAAGA AACCATTGAA 24 00 

AAACTTTGCG AGTTAGACAA TCACCCTATT GTTTTGACAG ACGATGACAG AAAATGCGTT 2460 

TTTAACCAAC AAGCAACGAG CGAACGATTT AAAAGAGCTA ATGAAGAAAT GCATGATCCC 2520 

ATCACCATCC TACAACTCTC AGACTGCGAT AGGCATTTCA AACAAATTGA AGATTGTTTC 2 580 

AGCGCAAACG ATAGAAACAA ATACGCTAAA AATAAGCAAA TGGAATTGAG CATGGCTTTT 2640 

AAAACAAGGC TTTTGTATGG CGGAGAAGAT GCGATAGAAA AACAAACAAA AAGAAATTTT 2700 

TTAAAATTAT TCAAATGGAT TGCATGGGCT ACAAACTTGA TCAAAAACTA A 2751 

(2) INFORMATION FOR SEQ ID NO: 25: 

{"i-) — S E QUENGE — GHARAGT-E R-X S T-I G S 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATGACTGCAA TGATGCGTTA TTTTCACATC TATGCGACCA CTTTTTTCTT CCCTTTGGCG 60 

CTTCTTTTTG CGGTTAGTGG GCTTTCATTG CTCTTTAAAG CGCGCCAAGA CACTGGCGCT 120 

AAGATCAAAG AATGGGTTTT AGAAAAATCC TTAAAAAAAG AAGAACGATT GGACTTTTTA 180 

AAAGGCTTTA TAAAAGAAAA CCATATCGCT ATGCCTAAAA AGATAGAGCC TAGAGAGTAT 240 

AGGGGAGCGT TAGTCATTGG CACGCCTTTG TATGAAATCA ACCTTGAAAC TAAAGGCACT 3 00 

CAAACGAAAA TCAAGACCAT TGAAAGGGGC TTTTTAGGCG CGCTCATCAT GCTGCATAAG 3 60 

GCTAAGGTGG GCATCGTGTT TCAGGCGCTT TTAGGGATTT TTTGCGTGTT TTTATTGTTG 42 0 

TTTTACTTGA GCGCGTTTTT AATGGTGGCT TTTAAAGACA CTAAACGCAT GTTTATAAGC 4 80 

GTTTTAATAG GGAGCGTGGT GTTCTTTGGA GCGATCTATT GGTCTTTGTA G 531 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQtTENCE CHARACTERISTICS: 

(A) LENGTH: 66 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAIi SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATtJRE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...669 



(xi) SEQUENCE DESCRIPTION: SI 

ATGTTTAAAA ACGCTTTAAA TATACAAGAT 
ATTATTGGCA CAT^TGGTGC TGGAAAATCA 
TCAGACTATA ATTTTAAAGC ACAAAACAAT 
CAACGCAAGC AATTGGGAGT TGTCTCTAAC 
AACGACCTTT TTAAATTCTA TCAATTTTTT 
AAAAATCTTT TAAATAAAAC CTACGAACAC 
ATTGACTTAG CTCTTAGCCA TCACCCACAA 
TTAGAGCAAA ACGCTCTTAT AAGACTATCA 
CTTACAAGTA TCATCGCCAC TCATGATCCT 
CTCCTTAAGA ATGGCAACAT TGCTCAATAC 
GCTAAAACTT TTAACTTTAA AGAAAAACCA 
GATATTTAA 



Q ID NO : 2 6 : 

TTTTCATTTA AAAATCATAC TAGTACAGCC 6 0 

ACGCTTATCA ACACTATTCT AGGCATTAGA 12 0 

AATATTCCAT ACCACGACAA TGTTATACCA 18 0 

CTATTCAACT ACCCACCTGG ATTAAACGCA 24 0 

CACAAAAACT GCACTCTAGA TTTGTTTGAA 3 00 

CTAAGCGACG GACAAAAACA GCGCTTAAAA 360 

TTAGTTATTA TGGATGAACC AGAAACCAGT 420 

AATCTCATAA GCTTGCGCAA CACCCAACAA 4 80 

ATTGTCTTAG ATAGTTGCGA ATGGGTATTG 540 

AAACCTTTAA ATTCTATATT AAAATCTGTA 6 00 

ACCACAAAAG ACTTATTAGC GTTACTAAAG 660 

669 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

ATGTATGCGG CTCATCCTAT TAAACCCATA AAAGCCCCTA AACTCAAATC TCAATTTTTA 

AGGCGTGTGT TTGTGGGCGC GTCCATTAGG CGCTGGAATG ACCAAGCATG CCCTTTGGAA 

TTTGTGGAAT TAGACAAGCA AGCCCATAAA GCGATGATTG CGTATCTGCT CGCTAAAGAT 

TTAAAAGATA GGGGTAAAGA TTTAGATTTA GATCTTTTAA TCAAATATTT TTGCTTTGAG 

TTTTTGGAGC GCTTGGTTTT AACCGATATT AAACCCCCTA TTTTTTACGC CCTCCAACAA 



001 aOOQA 1 I ^ 
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ACGCATAGTA AAGAGTTAGC TTCCTATGTT GCGCAAAGTT TGCAAGATGA AATCAGTGCG 360 

TATTTTTCTT TAGAGGAACT CAAAGAGTAT TTAAGCCACA GGCCTCAAAT TTTAGAAACT 420 

CAAATTTTAG AGAGCGCGCA TTTTTATGCG TCTAAGTGGG AGTTTGATAT TATCTATCAT 480 

TTTAACCCCA ACATGTATGG CGTGAAAGAG ATTAAAGATA AAATTGACAA GCAACTCCAC 540 

AATAACGATC ATTTGTTTGA AGGGCTTTTT GGGGAAAAAG AAGATTTGAA AAAATTGGTG 600 

AGCATGTTTG GGCAGTTGCG TTTCCAAAAG CGCTGGAGCC AAACCCCAAG AGTGCCACAA 660 

ACCAGTGTTC TAGGGCATAC TTTATGCGTG GCGATTATGG GGTATTTATT GAGTTTTGAC 72 0 

TTGAAAGCTT GTAAAAGCAT GCGGATCAAT CATTTTTTGG GCGGGCTTTT CCATGATTTA 780 

CCCGAAATTT TAACCCGAGA CATTATCACG CCCATCAAAC AAAGCGTTGC AGGGCTTGAT 84 0 

CATTGCATTA AAGAGATTGA AAAAAAGGAA ATGCAAAACA AAGTCTATTC CTTTGTGTCT 900 

TTGGGCGTTC AAGAAGATTT GAAATATTTC ACCGAAAACG AGTTTAAAAA CCGCTACAAA 960 

GACAAGTCTC ATCAAATCGT TTTCACTAAA GACGCTGAAG AATTATTCAC GCTTTATAAT 1020 

AGCGATGAAT ATCTTGGGGT TTGCGGGGAG CTTTTGAAGG TGTGCGATCA TTTGAGCGCG 1080 

TTTTTAGAAG CCCAAATCTC TCTTTCTCAT GGCATTTCTA GCTACGATTT AATCCAAGGA 1140 

GCTAAAAACC TTTTAGAATT GCGATCCCAA ACGGAACTGC TTGATTTGGA TTTAGGGAAA 1200 

--^TGtFTTAGAG-AT-T^AAGTA— A ■ -1221^ 

(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1008 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



GTGTTGTGGG TGCTATATTT TTTAACCAGT 
TCTAAAAAAT CCATGCTCTT TGTGGATAAC 
AGAACCCCAC GAGCCGGGGG GCTTGGGATC 
GAACCTTTTG AGATGCCTTT TAAGGGGCCT 
TTGAGCGGTT TTTTAGAAGA CATTAACCTT 
CAAGCTGTAG GGGTCGTTTG CATCATTTCA 
CCCCTTTTTA GCTTGCCTTA TTTCATCGCT 
ATCAGTAACG CTATTAATAT CATTGACGGG 
ATCGCGCTTT TAGTCATTCA TTATATAGAC 
ATGGTGCTTG GGTTTATGGT GTTAAATTTC 
GGGGCGTATT TTTTGGGTTT GGTGTGCGGG 
AAAATCAGCG TGTTTTTTGG GCTCAATTTA 
AGTATCCTTA GGCGCAAAAT AAAACGCCAG 
CACACCCTTT TATTTAAATT CTTGCAACAA 
GCGTTTATCC TTATTCTATG CAACCTGCCT 



TTATTTATTT GCTCTTTGAT TGTTTTGTGG 60 

GCTAATAAAA TCCAAGGCTT CCATCATGCA 120 

TTTCTTTCTT TTGCGTTGGC TTGTTATCTT 18 0 

TTTGTTTTCT TAGGGCTATC GCTAGTGTTT 24 0 

TCATTAAGCC CCAAAATACG CCTTATTTTG 300 

TCAACGCCTT TAGTGGTGAG CGATTTTTCG 360 

TTTTTATTCG CTATTTTTAT GCTGGTGGGT 420 

TTTAACGGGC TTGCATCTGG GATTTGCGCG 480 

CCTAGCAGTT TGTCTTGTTT GCTCGCTTAC 540 

CCTTCAGGAA AGATTTTTTT AGGCGATGGG 600 

ATTTCTCTCT TGCATTTGAG TTTGGAGCAA 660 

ATGCTTTATC CGGTCATAGA GGTGCTTTTT 720 

AAAGCCACCA TGCCGGATAA TTTGCATTTG 780 

CGCTCTTTCA ATTACCCTAA CCCTTTATGC 840 

TTTATTTTAA TAAGCGTTTT GTTTCGCTTG 900 
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GACGCTTATG CGCTCATTGT GATTAGCCTA GTCTTTATCG CATGCTATTT AATAGGCTAT 960 
GCTTATTTGA ATAGGCAAGT TTGCGCTTTA GAAAAGCGGG CGTTTTAA 1008 

(2) INFORMATION FOR SEQ ID NO: 29: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGAAAAAGG TTATTGTGGC TTTAGGCGTT TTGGCGTTCG CAAATGTTTT AATGGCAACC 6 0 

GATGTTAAGG CTCTTGTAAA AGGTTGTGCC GCTTGCCATG GGGTTAAGTT TGAAAAGAAA 12 0 

GCTTTAGGTA AAAGCAAAAT CGTTAACATG ATGAGCGAAA AAGAGATTGA AGAGGATCTT 180 

ATGGCTTTTA AAAGCGGTGC CAACAAGAAT CCTGTCATGA CCGCGCAAGC TAAAAAATTA 24 0 

AGCGATGAAG ACATCAAAGC TTTAGCCAAA TACATCCCCA CTCTCAAATA A 2 91 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...471 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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ATGCGAGATT TCAATAACAT TCAAATCACA CGCTTAAAAG TGCGTCAAAA TGCCGTTTTT 60 

GAAAAACTGG ATCTGGAGTT TAAAGATGGC TTGAGCGCGA TTAGTGGGGC TAGTGGGGTG 120 

GGGAAAAGCG TCCTTATTGC GAGCCTTTTA GGGGCGTTTG GGCTTAAAGA GAGCAACGCT 180 

TCAAACATTG AAGTGGAATT GATCGCGCCT TTTTTAGACA CGGAAGAATA CGGCATTTTT 240 

AGAGAAGATG AGCATGAACC CTTAGTTATT AGCGTGATTA AAAAAGAAAA AACACGCTAT 300 

TTTTTAAACC AAACAAGCCT ATCTAAAAAC ACGCTCAAAG CGTTATTAAA GGGGCTTATT 360 

AAACGCTTAT CTAACGACAG ATTCAGCCAG AATGAACTCA ACGATATTTT AATGCTCTCC 420 

TTATTAGATG GCTATATCCA AAATAAAAAT AGGCGTTTAG CCCCCTTTTA G 471 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY-;- - circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...357 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GTGATGCTAA TGGCAATTTT TACCCCTTAT ATTCTTATTT TGAAAATGAT GAAAAAGTCT 6 0 

ATGAGTTTAT TCGCCAATAT GGGGTTGGAG CAAATTTTTT GCAACAGAGA CATTAAAGAT 12 0 

TTAAATGATT TTGTTTTTGG TATAGAAGTG GGGCTTGATA GCAATGCGAG AAAAAATCGT 18 0 

AGCAGAAAGG CTATGGAAAA TCATCTTATC GGTCTTTTTG TCCAAGCTCA ATTAAATTTT 240 

AAAGAACAAG TAGATATTAG AGAATTTGAG GATTTACGCC AGGCTTTTGG AAATGATACT 3 00 

AAAAAATTTG ATTTTGTTAT TTTTAGCAAA GAGAAAACTT ATTTTCATAG AAGCTAA 357 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM.: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...1068 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATGAATATCA AAATTTTAAA AATATTAGTT GGAGGGTTAT TTTTTTTGAG CTTGAACGCC 60 

CATTTATGGG GGAAACAAGA CAATAGCTTT TTAGGGATTG GTGAAAGAGC CTATAAAAGC 12 0 

GGGAATTATT CTAAAGCGGC GTCTTATTTT AAAAAAGCAT GCAACGATGG GGTGAGTGAA 18 0 

GGCTGCACGC AATTAGGAAT CATTTATGAA AACGGGCAAG GCACTAGAAT AGATTATAAA 240 

AAAGCCCTAG AATATTATAA AACCGCATGC CAGGCTGATG ATAGGGAAGG GTGTTTTGGC 300 

TTAGGGGGGC TTTATGATGA GGGTTTAGGC ACGGCTCAAA ATTATCAAGA AGCCATTGAC 360 

GCTTACGCTA AGGCATGCGT TTTAAAACAC CCTGAGAGTT GCTACAATTT AGGCATCATT 42 0 

TATGATAGAA AAATCAAAGG CAATGCCGCT CAAGCGGTTA CTTACTATCA AAAAAGCTGT 480 

AATTTTGATA TGGCTAAGGG GTGTTATATT TTAGGCACTG CCTATGAAAA AGGCTTTTTA 54 0 

GAAGTCAAAC AGAGCAACCA TAAAGCCGTT ATCTATTATT TGAAAGCGTG CCGATTGAAT 600 

GAGGGGCAGG CTTGCCGAGC GTTAGGGAGT TTGTTTGAAA ATGGCGATGC AGGGCTTGAT 660 

GAAGATTTTG AAGTGGCGTT TGATTATTTG CAAAAAGCTT GCGCTTTAAA CAATTCTGGT 720 

GGTTGCGCGA GTTTAGGCTC TATGTATATG TTGGGCAGGT ATGTTAAAAA AGACCCCCAA 78 0 

AAGGCTTTTA ACTATTTCAA GCAAGCATGC GATATGGGGA GCGCGGTGAG TTGCTCTAGG 840 

ATGGGCTTTA TGTATTCGCA AGGGGACACT GTTTCAAAAG ACTTGAGGAA AGCCCTTGAT 900 

AATTATGAAA GAGGTTGCGA TATGGGCGAT GAAGTGGGTT GCTTCGCTCT AGCGGGCATG 960 

TATTACAACA TGAAAGATAA AGAAAACGCC ATAATGATTT ATGACAAGGG CTGTAAATTG 1020 

GGCATGAAAC AGGCATGCGA AAATCTCACC AAACTCAGGG GGTATTAG 1068 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE; DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 . . . 582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

ATGAAAGAAA AAAACTTTTG GCCTTTAGGA ATCATGAGCG TGCTTATTTT TGGGCTTGGG 6 0 

ATCGTGGTGT TTTTAGTGGT GTTTGCCCTA AAAAATTCGC CTAAAAATGA TTTAGTGTAT 120 

TTCAAGGGTC ATAACGAAGT GGATTTAAAC TTTAACGCCA TGCTTAAAAC TTATGAAAAC 180 

TTTAAATCCA ATTATCGTTT TTCAGTGGGT TTAAAGCCTC TTACCGAAAG CCCTAAAACC 240 

CCCATTTTGC CCTATTTTTC TAAAGGCACG CATGGGGATA AAAAAATCCA AGAAAACCTT 3 00 

TTAAACAACG CTTTGATTTT AGAAAAGTCC AACACGCTTT ATGCACAATT GCAACCGCTC 36 0 
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AAACCCGCTT TAGATTCGCC AAATATTCAA GTGTATTTAG CGTTCTATCC CAGCCAATCC 420 

CAGCCCAGAT TATTAGGAAC GCTTGATTGT AAAAACGCAT GCGAACCTTT AAAATTTGAT 48 0 

TTGTTAGAGG GCGATAAAGT GGGGCGCTAT AAGATCCTTT TTAAATTTGT TTTTAAAAAT 540 

AAAGAAGAAT TGATTTTGGA GCAACTGGCT TTTTTTAAGT AG 582 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TO POLOGY : c i r cul a r 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( iv) ANTI - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: Tnisc_f eature 

(B) LOCATION 1...8 70 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

TTGGGTATCA ATATGTGTTC TAAAAAAATA AGAAATCTCA TTTTATGCTT TGGTTTTATT 60 

TTAAGCTTGT GCGCTGAAGA AAATATCACC AAAGAAAACA TGACTGAAAC GAACACGACT 12 0 

GAAGAAAACA CCCCTAAAGA CGCTCCCATT CTTTTGGAAG AAAAACGCGC CCAAACTCTA 180 

GAGCTTAAAG AAGAAAATGA AGTGGCAAAA AAGATTGATG AAAAAAGCCT GCTTGAAGAA 240 

ATCCATAAGA AAAAACGCCA GCTTTACATG CTCAAAGGGG AATTGCATGA AAAGAATGAA 3 00 

TCCATCTTAT TCCAACAAAT GGCTAAAAAT AAGAGCGGCT TTTTTATAGG CGTGATCCTT 360 

GGCGATATAG GGATTAACGC TAATCCTTAT GAGAAGTTTG AACTTTTAAG CAATATTCAA 420 

GCTTCTCCCT TGCTGTATGG TTTAAGGAGC GGGTATCAAA AGTATTTCGC TAACGGGATT 4 80 

AGCGCCTTAC GCTTTTATGG GGAATATTTA GGGGGGGCGA TGAAAGGGTT TAAAAGCGAT 54 0 

TCTTTAGCTT CTTATCAAAC CGCAAGCTTG AATATTGATC TGTTGATGGA TAAGCCTATT 6 00 

GACAAAGAAA AAAGGTTTGC GTTAGGGATA TTTGGAGGCG TTGGAGTGGG GTGGAATGGG 66 0 

ATGTATCAAA ATTTAAAAGA GATTAGAGGG TATTCACAGC CTAACGCCTT TGGGTTGGTG 720 

TTAAATTTAG GGGTGAGCAT GACGCTCAAC CTCAAACACC GCTTTGAATT AGCCCTAAAA 780 

ATGCCTCCCT TAAAAGAAAC TTCGCAAACC TTTTTATATT ATTTTAAAAG CACTAATATT 840 

TATTATATTA GTTACAACTA TTTATTGTAA 8 70 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 007 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 



10 



(vi) ORIGINALi SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...2007 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 



60 
120 



ATGAGAAAAC TATTCATCCC ACTTTTATTA TTCAGCGCTT TAGAAGCGAA CGAGAAAAAC 
GGCTTTTTCA TAGAAGCCGG CTTTGAAACT GGGCTATTAG AAGGCACACA AACGCAAGAA 

15 AAAAGACACA CCACCACAAA AAACACTTAC GCAACTTACA ATTATTTACC CACAGACACG 180 
ATTTTAAAAA GAGCGGCTAA TTTATTCACC AATGCCGAAG CGATTTCAAA ATTAAAATTC 
TCATCTTTAT CCCCTGTTAG AGTGTTGTAT ATGTATAATG GTCAATTAAC TATAGAAAAC 
TTCTTGCCTT ATAATTTAAA TAATGTTAAG CTTAGTTTTA CAGACGCTCA AGGCAACACG 
ATTGATCTAG GCGTGATAGA GACCATCCCC AAACACTCTA AGATTGTTTT ACCCGGGGAG 

20 GCGTTTGATA GTTTAAAAGA GGCGTTTGAT AAAATTGACC CCTATACTTT ATTTCTTCCA 
AAATTTGAAG CCACTAGCAC TTCTATTTCT GATACTAACA CGCAGAGGGT GTTTGAAACG 
CTCAATAACA TTAAAACAAA TCTTATAATG AAATATAGTA ATGAAAATCC AAACAATTTC 
AACACTTGTC CTTACAATAA TAATGGTAAT ACAAAAAATG ATTGTTGGCA AAATTTCACC 
CCACAAACCG CAGAAGAATT CACCAATTTA ATGTTGAACA TGATCGCTGT CTTAGACTCC 

25 CAATCTTGGG GCGATGCGAT CTTAAACGCT CCTTTTGAAT TCACTAACAG CTCAACAGAT 
TGCGATAGCG ATCCTTCAAA ATGCGTAAAT CCCGGAGTAA ATGGGCGTGT TGATACTAAA 
GTCGATCAAC AATATATACT CAACAAACAA GGTATTATTA ATAATTTTAG AAAAAAAATA 900 
GAAATTGATG CGGTTGTTTT AAAAAATTCA GGGGTTGTAG GGTTAGCCAA TGGATATGGC 960 
AATGATGGTG AATATGGCAC ATTAGGGGTA GAAGCCTATG CTTTAGATCC TAAAAAACTC 1020 

30 TTTGGCAACG ACCTTAAGAC TATCAATTTA GAAGATTTAA GAACCATCTT GCATGAATTC ^ '^'^'^ 
AGCCACACTA AAGGCTATGG GCATAACGGG AATATGACCT ATCAAAGAGT GCCGGTAACG 
AAAGATGGTC AAGTGGAAAA GGATAGTAAT GGCAAGCCAA AAGATTCTGA TGGCCTCCCC 
TATAATGTGT GTTCGCTTTA TGGGGGATCC AATCAGCCCG CTTTCCCTAG CAACTACCCT 
AATTCCATCT ATCACAATTG TGCGGATGTC CCGGCTGGCT TTTTAGGGGT AACAGCAGCG 

35 GTTTGGCAGC AGCTCATCAA TCAAAACGCC TTGCCGATCA ACTACGCTAA CTTGGGGAGT 
CAAACAAACT ACAACCTAAA CGCTAGTTTA AACACGCAAG ATTTAGCCAA TTCCATGCTC 
AGCACCATCC AAAAAACCTT TGTAACTTCT AGCGTTACCA ACCACCATTT TTCAAACGCA 
TCGCAAAGTT TTAGAAGCCC TATTTTAGGG GTTAACGCTA AAATAGGCTA TCAAAACTAC 
TTTAATGATT TCATAGGGTT GGCTTATTAT GGCATCATCA AATACAATTA CGCTAAAGCT 

40 GTTAATCAAA AAGTCCAGCA ATTGAGCTAT GGTGGGGGGA TAGATTTGTT ATTGGATTTC 

ATCACCACTT ACTCCAATAA AAATAGCCCT ACAGGCATTC AAACCAAAAG GAATTTTTCT 1740 
TCATCTTTTG GTATCTTTGG GGGGTTAAGG GGCTTGTATA ACAGCTATTA TGTGTTGAAC 
AAAGTCAAAG GAAGCGGCAA TTTAGATGTG GCTACCGGGT TGAACTACCG CTATAAGCAT 
TCTAAATATT CTGTAGGGAT TAGCATCCCT TTAATCCAAA GAAAAGCTAG CGTCGTTTCT 

45 AGCGGTGGCG ATTATACGAA CTCTTTTGTT TTCAATGAAG GGGCTAGCCA CTTTAAGGTG 
TTTTTCAATT ACGGGTGGGT GTTTTAG 

(2) INFORMATION FOR SEQ ID NO: 36: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

55 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



1800 
1860 
1920 
1980 
2007 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .192 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

_ ATGAATACAG„ AAAT.T.XTAAC„CATCAT.GT.TA_GXT.GTCTCCG— TQ CTTATQGG- AT-TGGTAGGG- - -60- 

TTAATAGCGT TTTTATGGGG GGTTAAAAGC GGTCAGTTTG ACGATGAAAA ACGCATGCTT 120 

GAAAGCGTGT TGTATGACAG CGCGAGCGAC TTGAACGAAG CGATTTTACA AGAAAAACGC 18 0 

CAAAAGAATT AA 192 

(2) INFORMATION FOR SEQ ID NO : 3 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

( ii ) MOLECULE TYPE : DNA (genomic ) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 7 : - 

ATGGTATTTT TTCATAAGAA AATTATTTTA AATTTTATCT ATTCTTTAAT GGTTGCTTTT 6 0 

TTATTCCATT TATCCTATGG GGTTCTTTTA AAAGCCGATG GAATGGCTAA AAAGCAAACT 120 

CTTTTAGTGG GTGAAAGGCT TGTGTGGGAT AAGCTCACGC TGTTAGGGTT TTTAGAAAAA 180 

AACCATATCC CCCAAAAACT CTACTACAAT TTGAGCTCTC AAGATAAAGA ATTGAGTGCT 2 40 

GAAATCCAAA GCAATGTTAC CTACTACACT TTAAGAGATG CAAATAACAC GCTCATTCAA 3 00 

GCCCTTATCC CTATTAGCCA GGATTTGCAA ATCCATATTT ACAAAAAAGG AGAGGATTAT 3 60 

TTTTTAGACT TTATCCCCAT TGTTTTCACT CGTAAAGAAA GAACCCTCCT TCTTTCCTTA 4 20 

CAAACTTCGC CCTATCAAGA TATTGTCAAA GCCACCAATG ACCCCCTTTT AGCCAACCAA 4 80 

TTGATGAACG CGTATAAAAA AAGCGTGCCT TTTAAACGCC TAGTGAAAAA CGATAAAATC 540 

GCTATCGTTT ATACAAGGGA TTATCGTGTG GGGCAAGCGT TTGGCCAGCC GACCATCAAA 600 

ATGGCGATGG TTAGCTCTCG TTTGCACCAA TACTATCTTT TTTCCCATTC AAACGGGCGT 660 

TATTACGATT CAAAAGCGCA AGAAGTGGCA GGGTTTTTAC TAGAAACCCC GGTGAAATAC 72 0 
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ACCCGCATTT CTTCGCCTTT TTCGTATGGG AGGTTCCATC CTGTTTTAAA AGTTAAACGG 780 

CCTCATTACG GCGTGGATTA TGCGGCTAAA CATGGCAGTT TGATCCATTC TGCTTCAGAC 84 0 

GGCCGTGTGG GTTTTATAGG GGTTAAGGCG GGTTATGGGA AGGTGGTTGA AATCCATTTG 900 

AATGAATTGC GCTTGGTGTA TGCTCACATG AGCGCGTTCG CTAACGGATT AAAAAAAGGC 960 

TCGTTCGTTA AAAAAGGGCA AATCATAGGA AGAGTGGGAA GCACGGGTTT AAGCACCGGG 1020 

CCGCATTTGC ATTTTGGCGT GTATAAAAAC TCCCGCCCCA TTAATCCTTT AGGCTATATC 1080 

CGCACCGCTA AAAGCAAGCT GCATGGCAAA CAAAGAGAGG TTTTTTTAGA AAAAGCTCAG 114 0 

TATTCTAAGC AAAAATTAGA AGAACTTTTT AAAACCCATT CTTTTGAAAA AAATTCATTT 1200 

TATCTTTTAG AGGGTTTTTA A 1221 

(2) INFORMATION FOR SEQ ID NO: 38: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 91 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\ible 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
30 (B) LOCATION 1...891 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TTGTTTTTAG TCAAAAAAAT AGGCGTGGTA ATAATGATTT TAGTCTGCTT TTTAGCTTGC 6 0 

35 TCGCAAGAGA GCTTTATCAA AATGCAAAAA AAAGCCCAAG AGCAAGAAAA TGACGGCTCT 12 0 

AAACGCCCCA GCTATGTGGA TTCGGATTAT GAAGTCTTTA GCGAAACGAT TTTTTTACAA 180 

AACATGGTGT ATCAGCCTAT AGAGGAAAGA AACGCTTTTT TCCAACTGAC TAAAGATGAA 24 0 

GACAATTCTT TTAACCCTGA AAATTCCGTG ATTTTACTGA ATGAGCCAAG CGATAATAGT 300 

GAAAAAAACC TACTCTCATA CCCAAACGAT CCCAATAACA ATGAAGACAA CGCTAATAAT 360 

40 AGTCAAAAAA ATCCGTTCCT TTACAAGCCC AAAAGAAAAA CAAAAAACCC AAAACTCATT 420 

GAATATTCCC AACAAGATTT CTACCCCCTA AAAAATGGGG ATATTATCAT GAGTAAAGAA 48 0 

GGGGATCAAT GGTTGATAGA AATCCAATCC AAAGCCTTGA AGCGTTTTTT AAAAGATCAA 54 0 

AACGATAAAG ATCGCCAGAT CCAAACTTTC ACTTTTAATG ACACTAAAAC GCAAATCGCG 600 

CAAATTAAGG GCAAAATTTC TTCGTATGTT TATACCACCA ATAACGGTAG CTTGAGTTTA 660 

45 AGGCCTTTTT ATGAATCGTT TTTGTTAGAA AAAAAGAGCG ATAATGTTTA TACGATAGAG 720 

AATAAGGCTT TAGATACTAT GGAGATTTCA AAGTGTCAAA TGGTGTTAAA AAAGCATTCA 780 

ACCGATAAAT TAGACAGCCA GCATAAAGCC ATCAGTATTG ATTTGGATTT TAAAAAAGAG 84 0 

CGCTTTAAGA GCGATACGGA ACTCTTTTTA GAATGTCTTA AGGAAAGTTA G 8 91 

50 (2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 7 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : double 
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(D) TOPOLOGY: circular 
(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...747 

J X i ). ._S EO0ENCE„DES CR I P_TTON.: _„ S EO_I D_NO.:3.9-:- - 

GTGAGCTATG ACAACACCGA TGATTATTAT TTCCCTAGAA ATGGGGTTAT CTTTAGTTCC 6 0 

TATGCGACAA TGTCTGGTTT GCCAAGCTCT GGCACGCTCA ATTCTTGGAA CGGGTTAGGC 12 0 

GGGAATGTCC GTAACACCAA AGTTTATGGT AAATTCGCCG CTTACCACCA TTTGCAAAAA 18 0 

TATTTATTGA TAGATTTGAT CGCTCGTTTT AAAACGCAAG GGGGCTATAT CTTTAGGTAT 240 

AACACCGATG ATTACTTGCC CTTAAACTCC ACTTTCTACA TGGGGGGCGT AACCACGGTG 300 

AGAGGCTTTA GGAACGGCTC AATCACACCT AAAGATGAGT TTGGCTTGTG GCTTGGAGGC 36 0 

GATGGGATTT TTACCGCTTC TACTGAATTG AGCTATGGGG TGTTAAAAGC GGCTAAAATG 42 0 

CGTTTAGCGT GGTTTTTTGA CTTTGGTTTC TTAACCTTTA AAACCCCAAC TAGGGGGAGT 48 0 

TTCTTCTATA ACGCTCCCAC CACGACGGCG AATTTTAAAG ATTATGGCGT TGTAGGGGCT 54 0 

GGGTTTGAAA GGGCGACTTG GAGGGCTTCT ACAGGCTTAC AGATTGAATG GATTTCGCCC 600 

ATGGGGCCTT TGGTGTTGAT TTTCCCTATA GCGTTTTTCA ACCAATGGGG CGATGGCAAT 660 

GGCAAAAAAT GTAAAGGGCT GTGCTTTAAC CCTAACATGA ACGATTACAC GCAACATTTT 72 0 

GAATTTTCTA TGGGAACAAG GTTTTAA 747 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dovible 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE; NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...1008 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 



GTGCAACACT TCAATTTCCT CTATAAAGAT TCTTTATTTT CTATCGCTTT ATTCACTTTC 6 0 
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ATTATCGCTC TTGTGATTTT ATTAGAACAG GCTAGAGCGT ATTTCACCCG AAAGAGAAAC 12 0 

jij^j^j^rj^rj^r^r TGCAAAAATT CGCCCAAAAT CAAAACGCCT ATGCGAGCAG CGAGAATTTA 18 0 

GACGAGCTTT TAAAGCATGC TAAAATTTCC AGTTTGATGT TTTTAGCTAG GGCGTATTCT 240 

AAAGCGGATG TGGAAATGAG CATTGAAATC TTAAAAGGGC TTTTGAATCG CCCCTTAAAA 300 

5 GATGAAGAAA AAATCGCTGT TTTAGATTTA TTGGCTAAAA ATTATTTTAG CGTGGGGTAT 360 

TTGCAGAAAA CAAAAGACAC CGTGAAAGAA ATTTTGCGCT TTTCCCCAAG GAATGTGGAA 42 0 

GCGTTGTTGA AGCTTTTGCA TGCGTATGAA TTAGAAAAAG ATTATTCAAA GGCTTTAGAA 48 0 

ACTTTGGAAT GTTTGGAAGA ATTAGAGGTG CCTAAAATTG AAACGATTAA AAATTACCTC 540 

TATTTAATGC ATTTAATAGA GAATAAGGAA GATGCGGCTA AAATCTTGCA TGTTTCAAAA 6 00 

10 GCGTCGTTAG ATTTGAAAAA AATCGCTCTG AATCACTTAA AATCGCATGA TGAAAATCTT 660 

TTTTGGCAAG AAATTGATAC AACCGAACGG CTAGAAAATG TGATCGATCT TTTATGGGAT 72 0 

ATGAATATCC CTGCTTTTAT TTTAGTUUUU^ CATGCCCTTT TGCAGGACAT CGCGCGATCT 780 

CAAGGGTTGC TTTTGGATCA CAAACCTTGC CAAATTTTTG AATTAGAGGT TTTACGCGCT 840 

CTATTGCATA GCCCTATAAA AGCGAGTCTG ACTTTTGAAT ACCGCTGCAA GCATTGCAAA 900 

15 CAAATCTTTC CTTTTGAAAG CCATAGGTGT CCTGTGTGTT ACCAGTTAGC GTTTATGGAT 960 

ATGGTGCTTA AAATCTCTAA AAAAACGCAT GCTATGGGAG TGGATTAA 1008 

(2) INFORMATION FOR SEQ ID NO: 41: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: circular 

25 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

35 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1.,.1242 



40 



45 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



TTATTGGGGT 


GGTTTATGCA 


60 


AAGAGTCTTC 


TTTGGATAAA 


120 


AGGAATTAAA 


GAATAAGGAT 


180 


GAAAACCCAG 


AGCAGAAGTC 


240 


CTCCTAAAAT 


CAAAGGGAGT 


300 


CAAAACCTGA 


AGAAAAAGAT 


360 


CTAGCTCTCA 


ATTCAATTCC 


420 


AAGATAAGGT 


CGTAGGGGGC 


480 


AAATCCAAGA 


AGAGCAAGAA 


540 


TCGCTGAACG 


CATTAAAAAC 


600 


AGCTAGACCA 


AGAAATGGCG 


660 


AACAGATGCT 


TATGGCTGAG 


720 


TAGAAATGCA 


AGAATTGTTG 


780 


CCAAAATGCG 


CGAATATTAC 


840 


AAACCGTGCG 


CTACACTTCC 


900 
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ACCAATCAAG AGGATTTAGA AAGGGCTATG GCAGACCCTA ATTTGGAAGT CCCAGGGGTG 960 

AGTAAGGCCA ATGAAAAAAT AGAGATGAAA ACCCTAAACC CTCAAATCGC CCAAGTCTTT 1020 

ATTTCGCATG AGCAAGGCTC TTTCACGCCC GTTATGAATG GGGGTGGGGG GCAGTTCATC 1080 

ACCTTTTATA TCAAGGAAAA AAGGGGTAAA AATGAAGTGA GCTTCAGTCA GGCCAAGCAA 1140 

TTCATCGCCC AAAAATTAGT GGAAGAATCT AAGGATAAGA TTTTAGAAGA GCATTTTGAA 1200 

AAATTGCGCG TTAAGTCTAG GATTGTGATG ATCAGAGAGT GA 1242 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: circular 

( ii) MOLECULE TYPE : DNA (genomic.) . . 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

ATGATTAAAA GAATTGCTTG TATTTTAAGC TTGAGCGCGA GTTTAGCGTT AGCTGGCGAA 60 

GTGAATGGGT TTTTCATGGG TGCGGGTTAT CAACAAGGTC GTTATGGCCC TTATAACAGC 12 0 

AATTACTCTG ATTGGCGTCA TGGCAATGAC CTTTATGGTT TGAATTTCAA ATTAGGTTTT 18 0 

GTAGGCTTTG CCAATAAATG GTTTGGGGCT AGGGTGTATG GCTTTTTAGA TTGGTTTAAC 24 0 

ACTTCAGGGA CTGAACACAC CAAAACCAAT TTGCTCACCT ATGGCGGCGG TGGCGATTTG 3 00 

ATTGTCAATC TCATTCCTTT GGATAAATTC GCTCTAGGTC TCATTGGTGG CGTTCAATTA 3 60 

GCCGGAAACA CTTGGATGTT CCCTTATGAT GTCAATCAAA CCAGATTCCA GTTCTTATGG 420 

AATTTAGGCG GAAGAATGCG TGTTGGGGAT CGCAGTGCGT TTGAAGCGGG CGTGAAATTC 4 80 

CCTATGGTTA ATCAGGGTAG CAAAGATGTA GGGCTTATCC GCTACTATTC TTGGTATGTG 540 

GATTATGTCT TCACTTTCTA G 561 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 72 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY ; c i r cul ar 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 
5 (A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .729 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

10 ATGAAAAAAT TTTTTTCTCA ATCTTTGTTA GCTCTTATTA TCTCTATGAA TGCGGTATCT 60 

GGCATGGATG GTAATGGCGT TTTTTTAGGG GCGGGTTATT TGCAAGGACA GGCGCAAATG 120 

CATGCGGATA TTAATTCTCA AAAACAAGCC ACCAACGCTA CGATCAAAGG CTTTGACGCG 180 

CTCTTGGGGT ATCAATTTTT CTTTGAAAAA CACTTTGGCT TACGCCTTTA TGGGTTTTTT 24 0 

GACTACGCTC ATGCCAATTC TATTAAGCTT AAAAACCCTA ACTATAATAG CGAAGCGGCG 3 00 

15 CAAGTGGCTA GTCAAATTCT TGGGAAACAA GAAATCAATC GTTTAACAAA CATTGCCGAT 360 

CCCAGAACTT TTGAGCCGAA CATGCTCACT TATGGGGGGG CTATGGACGT GATGGTTAAT 420 

GTCATCAATA ACGGCATCAT GAGTTTGGGG GCTTTTGGCG GGATACAATT GGCCGGCAAT 480 

TCATGGCTTA TGGCGACACC GAGCTTTGAG GGCATTTTAG TGGAACAAGC CCTTGTGAGC 540 

AAGAAAGCCA CTTCTTTCCA ATTTTTATTC AATGTGGGGG CTCGCTTAAG GATCTTAAAA 6 00 

20 CATTCTAGCA TTGAAGCGGG CGTGAAATTC CCCATGCTAA AGAAAAACCC CTACATCACT 6 60 

GCAAAAAATT TGGATATAGG GTTTAGGCGC GTGTATTCGT GGTATGTGAA TTACGTGTTC 72 0 
ACTTTCTAG 



25 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 44; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: circular 



729 



(ii) MOLECtJLE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...771 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATGGGATACG CAAGCAAATT AGCTTTAAAG ATTTGTTTGG TAGGTTTATG TTTATTTAGC 6 0 

ACCCTTGGTG CAGAACACCT TGAGCAAAAA GGGAATTATA TTTATAAGGG AGAGGAGGCT 12 0 

TATAATAATA AGGAATATGA GCGAGCGGCT TCTTTTTATA AGAGCGCTAT TAAAAATGGT 180 

50 GAGTCGCTTG CTTATATTCT TTTAGGGATC ATGTATGAAA ATGGTAGGGG TGTACCTAAA 24 0 

GATTACAAGA AAGCGGTTGA ATATTTCCAA AAAGCTGTTG ATAACGATAT ACCTAGAGGG 300 

TATAACAATT TGGGCGTGAT GTATAAAGAG GGTAAGGGAG TTCCTAAAGA TGAAAAGAAA 3 60 

GCGGTGGAAT ATTTTAGAAT AGCTACAGAG AAAGGTTATA CTAACGCTTA TATCAACTTA 42 0 

GGCATCATGT ATATGGAGGG CAGGGGAGTT CCAAGTAACT ATGCGAAAGC GACAGAATGT 4 80 

55 TTTAGAAAAG CGATGCATAA GGGCAATGTG GAAGCTTATA TTCTCCTAGG GGATATTTAT 54 0 
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TATAGCGGGA ATGATCAATT GGGTATTGAG CCGGACAAAG ATAAGGCTGT TGTCTATTAT 600 

AAAATGGCGG CTGATGTGAG TTCTTCTAGA GCTTATGAAG GGTTGTCAGA GTCTTATCGG 660 

TATGGGTTAG GCGTGGAAAA AGATAAAAAA AAGGCTGAAG AATACATGCA AAAAGCATGC 720 

GATTTTGACA TTGATAAAAA TTGTAAGAAA AAGAACACTT CAAGCCGATA A 771 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTKET-I-GAL-: -NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...1974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATGAGAAAAC TATTCATCCC ACTTTTATTA TTCAGCGCTT TAGAAGCGAA CGAGAAAAAC 60 

GGCTTTTTCA TAGAAGCCGG CTTTGAAACT GGGCTATTAG AAGGCACACA AACGCAAGAA 12 0 

AAAAGACACA CCACCACAAA AAACACTTAC GCAACTTACA ATTATTTACC CACAGACACG IBO 

ATTTTAAAAA GAGCGGCTAA TTTATTCACC 7UVTGCCGAAG CGATTTCiVAA ATTAAAATTC 24 0 

TCATCTTTAT CCCCTGTTAG AGTGTTGTAT ATGTATAATG GTCAATTAAC TATAGAAAAC 300 

TTCTTGCCTT ATAATTTAAA TAATGTTAAG CTTAGTTTTA CAGACGCTCA AGGCAATGTG 360 

ATCGATCTAG GCGTGATAGA GACTATCCCC AAACACTCTA AGATTGTTTT GCCCGGAGAG 42 0 

GCATTTGATA GTCTAAAAAT TGACCCCTAT ACTTTATTTC TTCCAAAAAT TGAAGCCACT 480 

AGCACTTCTA TTTCTGACGC TAACACGCAG AGGGTGTTTG AAACGCTCAA TAAGATTAAG 54 0 

ACAAATTTGG TCGTAAATTA TAGGAATGAA AACAAATTTA AAGATCACGA AAATCATTGG 600 

GAAGCCTTTA CCCCACAAAC CGCAGAAGAA TTCACTAATT TAATGTTGAA CATGATCGCT 660 

GTTTTAGACT CCCAATCTTG GGGCGATGCG ATCTTAAACG CTCCTTTTGA GTTCACTAAC 720 

AGCCCAACAG ATTGCGATAA TGATCCTTCA AAATGCGTAA ATCCTGGGAC AAACGGGCTT 7 80 

GTCAATTCTA AAGTCGATCA AAAATATGTG TTAAACAAAC AAGACATTGT CAATAAATTT 84 0 

AAAAACAAAG CGGATCTTGA TGTAATTGTT TTAAAGGATT CAGGGGTTGT AGGGCTTGGG 900 

AGTGATATTA CCCCTAGCAA CAATGATGAT GGCAAGCATT ATGGCCAGTT AGGGGTAGTA 960 

GCTTCTGCTT TAGATCCTAA AAAACTCTTT GGCGATAACC TTAAGACTAT CAATTTAGAG 1020 

GATTTAAGAA CCATCTTGCA TGAATTCAGC CACACTAAAG GCTATGGGCA TAACGGGAAT 1080 

ATGACCTATC AAAGAGTGCC GGTAACGAAA GATGGTCAAG TGGAAAAGGA TAGTAATGGC 114 0 

AAGCCAAAAG ATTCTGATGG CCTCCCCTAT AATGTGTGTT CGCTTTATGG GGGATCCAAT 1200 

CAGCCCGCTT TCCCTAGCAA CTACCCTAAT TCCATCTATC ACAATTGTGC GGATGTCCCG 1260 

GCTGGCTTTT TAGGGGTAAC AGCAGCGGTT TGGCAGCAGC TCATCAATCA AAACGCCTTG 13 20 

CCGATCAACT ACGCTAACTT GGGGAGTCAA ACAAACTACA ACCTAAACGC TAGTTTAAAC 13 80 

ACGCAAGATT TAGCCAATTC CATGCTCAGC ACCATCCAAA AAACCTTTGT AACTTCTAGC 144 0 

GTTACCAACC ACCATTTTTC AAACGCATCG CAAAGTTTTA GAAGCCCTAT TTTAGGGGTT 1500 

AACGCTAAAA TAGGCTATCA AAACTACTTT AATGATTTCA TAGGGTTGGC TTATTATGGC 1560 

ATCATCAAAT ACAATTACGC TAAAGCTGTT AATCAAAAAG. TCCAGCAATT GAGCTATGGT 162 0 
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GGGGGGATAG ATTTGTTATT GGATTTCATC ACCACTTACT CCAATAAAAA TAGCCCTACA 1680 

GGCATTCAAA CCAAAAGGAA TTTTTCTTCA TCTTTTGGTA TCTTTGGGGG GTTAAGGGGC 1740 

TTGTATAACA GCTATTATGT GTTGAACJUUV GTCAAAGGAA GCGGCAATTT AGATGTGGCT 18 00 

ACCGGGTTGA ACTACCGCTA TAAGCATTCT AAATATTCTG TAGGGATTAG CATCCCTTTA 1860 

5 ATCCAAAGAA AAGCTAGCGT CGTTTCTAGC GGTGGCGATT ATACGAACTC TTTTGTTTTC 1920 

AATGAAGGGG CTAGCCACTT TAAGGTGTTT TTCAATTACG GGTGGGTGTT TTAG 1974 

(2) INFORMATION FOR SEQ ID NO:46: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

15 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

25 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...504 



30 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



ATGAAATTGG TGAGTCTTAT TGTAGCGTTA GTTTTTTGTT GTTTTTTAGG GGCTGTAGAG 60 

TTGCCTGGAG TTTATCAAAC TCAAGAATTT TTATACATGA AAAGCTCTTT TGTGGAGTTT 120 

TTTGAGCATA ACGGGAAGTT CTATGCCTAT GGTATTTCTG ATGTGGATGG CTCTAAAGCC 18 0 

AAAAAAGACA AACTCAATCC TAACCCAAAG CTAAGGAATC GCAGCGATAA AGGCGTGGTG 240 

35 TTTTTAAGCG ATTTGATTAA GGTTGGGGAA CAATCTTATA AAGGCGGTAA GGCGTATAAT 300 

TTTTATGACG GCAAGACCTA CCATGTGAGA GTCACTCAAA ATTCAAACGG GGATTTGGAA 360 

TTCACTTCAA GCTATGACAA ATGGGGGTAT GTGGGCAAAA CCTTCACCTG GAAACGCCTG 420 

AGCGATGAAG AAATCAAAAA TCTAAAGCTC AAGCGTTTTA ACTTGGACGA AGTCCTTAAA 4 80 

ACCCTCAAAG ATAGCCCTAT TTAA 



504 



(2) INFORMATION FOR SEQ ID NO: 47; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



55 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY; inisc_f eature 

(B) LOCATION 1. . . 885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

ATGAGTAATC AAGCGAGCCA TTTGGATAAT TTTATGAACG CTAAAAATCC CAAAAGTTTT 60 

TTTGATAATA AGGGGAATAC CAAATTCATC GCTATCACAA GCGGTAAGGG GGGCGTGGGG 12 0 

AAATCCAACA TTAGCGCTAA TTTAGCTTAC TCTTTATACA AGAAAGGTTA TAAGGTAGGG 180 

GTATTTGATG CGGATATTGG TTTAGCGAAT TTAGATGTCA TTTTTGGGGT GAAAACCCAT 240 

AAAAATATCT TGCATGCCTT AAAAGGCGAA GCCAAATTGC AAGAAATCAT TTGCGAGATT 3 00 

GAACCCGGGC TTTGCTTAAT CCCTGGGGAT AGCGGCGAAG AAATTTTAAA ATACATCAGC 360 

GGCGCGGAAG CTTTGGATCG ATTCGTAGAT GAAGAGGGGG TTTTAAGCTC TTTAGATTAT 420 

--ATTGTGATT.G_ATACGGGTGC_^GGGAT:EGGG_QCCACTACGC-AAGCGT-TTT-T-GA^ 480^ 

GATTGCGTGG TGATTGTTAC CACACCCGAT CCTTCAGCGA TTACCGATGC GTATGCATGC 540 

ATTAAAATCA ACTCCAAGAA TAAAGATGAA TTGTTCCTTA TCGCTAACAT GGTAGCCCAA 600 

CCTAAAGAAG GCAGGGCGAC TTATGAAAGG CTATTCAAGG TGGCTAAAAA CAATATCGCT 660 

TCATTAGAAT TGCACTATTT AGGGGCGATT GAAAACAGCT CCTTATTGAA ACGCTATGTG 72 0 

AGGGAGCGAA AGATTTTGAG GAAAATAGCC CCTAACGATT TGTTTTCGCA ATCCATTGAC 78 0 

CAGATAGCGA GCCTTTTAGT TTCTAAACTA GAAACCGGCA CTTTAGAAAT ACCAAAAGAA 84 0 

GGTTTAAAAA GCTTTTTTAA AAGGCTTTTG AAGTATTTGG GGTAG 88 5 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1. . ,1119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

TTGGAACCTT CAAGAAATCG CCTAAAACAT GCCGCCTTTT TTGTGGGGCT TTTTATCGTT 60 

TTGTTTTTAA TTATAATGAA GCACCAAACC TCCCCCTATG CTTTCACGCA TAATCAAGCC 120 

CTTGTCACTC AAACCCCCCC CTATTTCACG CAACTCACTA TCCCTAAACC AAATGACGCT 180 

TTAAGCGCGC ATGCGAGCTC TTTAATCAGC TTGCCTAACG ACAATCTTTT GAGCGCTTAT 240 

TTTAGCGGCA CTAAAGAAGG GGCAAGGGAT GTGAAAATCA GCGCGAATCT TTTTGACAGC 3 00 

AAGACTAATC GCTGGAGCGA AGCCTTCATT CTTTTAACCA AAGAAGAGCT TTCTCATCAT 360 

TCGCATGAAT ACATCAAAAA ATTAGGTAAC CCCTTGCTTT TTTTGCATGA TAATAAAATT 420 

TTGTTGTTTG TCGTAGGGGT . GAGCATGGGC GGGTGGGCCA CTTCTAAAAT CTATCAATTT 4 80 
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GAAAGCGCTT TAGAGCCGAT TCATTTTAAG 
TTAAATTTGA GCCATTTAGT AAGGAATAAG 
CTACCACTCT ATCACGAATT AGCCACCCAA 
AATAACCCAA GAGAGCTTTT AAGGCCTAAT 
ACCCCCTTTA AAGACTGCGC TGTCATGGCG 
ATGCTAGAAA CCTGTAAAAC CCCCACTGAT 
AACTTAGATG ATTCTTTAAA TTTACTCAAT 
CCTAGCGATT TATCACTGCG TCGTAAAGAA 
TCGTTTAAAA CCTTAAAAGT TTTGGATAAA 
CTTAATCCGC ATTTTATAGA TATTGTCTAT 
CGTTTCAATA TGGCTTATTT AAATTCCCTT 



- 124- 

TTTGCGCGAA AACTCTCTTT AAGCCCTTTT 540 

CCTTTAAACA CCACTGATGG CGGGTTTATG 600 

TACCCCTTGT TGTTGAAATT TGACCAACAA 66 0 

ACCTTAAACC ACCAGCTCCA ACCAAGCTTA 72 0 

TTTAGAAACC ATTCTTTTAA AGATAGCCTC 780 

TGGCAAAAAC CCATTTCTAC AAATCTTAAA 84 0 

TTAAATGGAA TATTGTATTT GATCCACAAC 900 

CTTTGGCTTT CTAAATTAGA AAACTCCAAC 96 0 

GCGAATGAAG TGAGTTACCC AAGCTATAGC 102 0 

ACTTACAACC GCTCTCATAT CAAACACATC 1080 

CTCAAGTGA 1119 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE; 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...2937 



(xi) SEQUENCE DESCRIPTION: SI 

ATGAAGAAAA GAAAACATGT ATCCAAGAAA 
GTATTCACTC TTTTAGTCGT CATTCACAAA 
TTAAAAATTG GAAAACTTGG CATTTCTGAA 
TTGGAAGTTG AGCGGGTTGA TCTCTCTTCT 
TTAGAAGTTT CTGATTTGAT TAAAAATATC 
GAAAAACTTA AAGTCAAAGA AATCATTTTA 
GATGGGAATA AATACGAGTT AGAATTTCCA 
GATAAAAATA TCAAGCTTAA AATCATCAAT 
GATGGCAACG CCCACTATTC ACCCAAAGCC 
CCCTTAGTTG AACCCAGCGC TGCAATTTAT 
GAATTAAAAA TTAACACTTC TCCAATGAAA 
CGCCAATCGC AAAAAAATTT AAAAACGTGG 
AAGATTGATA ACGCTTTAAT CAAGGCTAAT 
TTGGAAAATT CTGTAGTTAA AGCCACTTTG 
TTATCGCCCA TTAAAATGGA TAAAACCGAA 
CAGCCCCAAA AAATCACTTA TGAAACCATG 
AATTTGTTAG AAGCCCCTAA GTTGGAGGTT 
GATAGCATTA AGGATTTATT GAGCGCTTAT 
ATGCCATCTA GCGCGGATTT GAAGCTCACT 
TTTAGCGTTC AAGGCAGCGT TAATTTGCAA 



:Q id NO: 49: 

GTGTTTAATG TCATTATCTT GTTTGTGGCA 6 0 

ACCCTTTCAA ACGGCATTCA CATACAAAAT 120 

TTATACTTAA AACTCAATAA CAAGCTTTCT 180 

TTCTTCCATC AAAAACCCAC TAAAAAGCGT 240 

CGTTATGGCA TTTGGGCGGT GTCTTATTTT 3 00 

GACGATAAAA ATAAAGCCAA TATCTTTTTT 360 

GGAATCAAAG GGGAATTTTC CCTAGAAGAC 420 

TTGCTTTTTA AAGATGTTAA AGTCCAAGTG 48 0 

AGGAAAATGG CGTTCAATTT GATTGTCAAG 54 0 

TTGCAAGGGC TAACCGATTT AAAAACCATA 600 

AGCCTAGCGT TTTTAAAGCC TCTTTTCCAA 660 

ATTTTTGACA AGATCCAATT TGCCAGCTTT 720 

TTCACTCCTA GCGAGTTTAT CCCATCGCTT 780 

ATTAAGCCTT CAGTCGTTTT TAATGATGGC 840 

TTGATTTTCA AT^AACAAACA GCTCCTCATA 900 

GAATTAACCG GCTCTTACGC CACTTTTTCC 960 

TTTTTAAAAA CGACCCCTAA TTATTATGGC 102 0 

AAAGTCGTTT TACCTTTGGA TAAAATCAGC 1080 

TTGCAATTCT TAAAAAACAC CGCCCCCTTA 1140 

GAAGGCACTT TCTCGCTCTA TAATATCCCC 12 00 



wo 98/18323 



PCT/US97/19575 



- 125- 

CTTTACACGC AAAGCGCTCA AATCAATTTG GACATCGCCC AAGAATACCA ATACATCTAC 1260 

ATAGACACGA TCCACACGCG CTATGCAAAC ATGCTGGATT TAGACGCTAA AATCGCTTTA 1320 

GATTTAGGTC AAAAAAACCT TTCTTTGGAT TCTTTAGTCC ATAAAATCCA AGTCAATACC 1380 

AATAACAATA TCAACATGCG CTCTTATGAT CCCAATAACA CTCAAGAAGA TCCGCAAACT 144 0 

AACTTTACTT TGGATCTAAA AAGCTTGCAT TCTATCATTC AAGAGGGTGA AAATTCAGAA 1500 

GTTTTTAGAA GAAAAATCAT AGACACCATT AAAGCCCAAA GCGAAGATAA ATTCACT7\AA 1560 

GATGTTTTTT ACGCCACAGG AGACACTCTC AAAAGCCTGT CGTTGAGTTT TGATTTTTCC 1620 

AACCCCGATC ACATACAATG GAGCGTGCCA CAACTCTTAT TAGAAGGCGA ATTTAAAGAT 1680 

AACGCCTATA CTTTTAAGAT CAAAGATTTG AAAAAGATCA AGCCCTATTC CCCCATTATG 1740 

GACTATATTG CCCTAAAAGA CGGCTCTTTA GAGGTTTCTA CG AGCGATTT TGTCAATATT 1800 

GATTTTTTTG CTAAAGATTT GAAAATCAAC CTCCCCATTT ATAGGAGCGA TGGATCGCAT 1860 

TTTGATTCTT TTTCTTTATT TGGCTCTATC AATAAAGATG AAATTTCTGT CTATACTCCA 1920 

AGCAAAAGCA TATCCATAAA AGTTAAGGGG GATCAAAAGG ATATTACCCT TAATAACATT 1980 

GATTTGAGTA TTGATGATTT CTTGGATAGT AAAATGCCAG CTATTGCGGG ATTATTCTCA 2 04 0 

AAAGAACGAA AAGAAAAGCC TAGCTCTAAA GAAATCCAAG ATGAAGATGT TTTCATTAGC 2100 

-GCeAAAeAAG-GGTATGAAAA-AGGGGASAAA-ATTATSSeeA-TeTeTAeAeG-eATeeATGeT -2-iSO- 

AAAGATGTCG TGCTGATCTA TAAAAAAATG CCTTTTCCTT TAGAAAATCT TGATATTGTC 2220 

GCTCAAGACG ATAGGGTGAA AATTGATGGC AATTATAAAA ACGCCATGAT CATGGCSGAT 2280 

TTAGTGCATG GGGCTTTGTA TCTTT^GGCT CATAATTTTA GCGGGGATTA TATCAACACC 2340 

ATTCTTCAAA AAGATTTCGT AGAAGGAGGC TTATTCACGC TTATTGGGGC TCTTGAAGAT 24 00 

CAGGTTTTCA ATGGCGAATT GAAATTCCAA AACACAAGCT TAAAGAATTT CGCCCTCATG 2460 

CAAAACATGG TCAATCTCAT CAACACCATT CCCTCCCTCA TTGTCTTTAG AAACCCTCAT 2520 

TTAGGGGCTA ATGGCTATCA AATCAAAACC GGCTCCGTTG TGTTTGGGAT CACTAAAGAA 25 80 

TATTTAGGGT TAGAAAAAAT TGATCTTGTC GGCAAAACGC TTGATATTGC TGGCAATGGA 2640 

ATCATTGAAT TAGACAAAAA CAAATTAGAT TTAAACTTAG AAGTTTCCAC TATCAAGGCT 2 7 00 

TTGAGTJ\ATG TCTTAAATAA AATCCCTATC GTGGGCTATC TCGTTTTAGG AAAAGGAGGT 2 760 

AAAATCACCA CTAACGTGAA TGTCAAAGGC ACGTTGGATA AGCCTAAAAC CCAAGTAACT 2820 

TTAGCGTCAG ATATTATCCA AGCGCCTTTT AAAATCTTAC GCCGTATTTT CACGCCTATT 2880 

GACATCATCG TGGATGAAGT CAAGAAAAAC ATTGATTCAA AAAGGAAATT AAAATGA 2937 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .1434 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

ATGAATACTA TTATAAGATA TGCGAGTTTA TGGGGCTTGT GTATTACTCT AACTCTAGCG 60 

CAAACCCCCT CTAAAACCCC TGATGAAATC AAGCAAATCC TTAACAATTA TAGCCATAAG 12 0 
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AATTTAAAGC TCATTGATCC GCCGACAAGT TCTTTAGAAG CGACACCGGG TTTTTTACCC 180 

TCGCCTAAAG AAACAGCGAC CACGATCAAT CAAGAGATCG CTAAATACCA TGAAAAAAGC 240 

GATAAAGCCG CTTTGGGGCT TTATGAATTG CTAAAGGGGG CTACCACCAA TCTCAGTTTG 3 00 

CAAGCGCAAG AACTCAGTGT CAAGCAAGCG ATGAAGAACC ACACCATCGC CAAAGCGATG 360 

TTTTTGCCTA CTTTGAACGC GAGTTATAAT TTTAAAAATG AAGCTAGGGA TACTCCAGAA 420 

TATAAGCATT ATAACACCCA ACAACTCCAA GCTCAAGTCA CATTGAATGT GTTTAATGGC 480 

TTTAGCAATG TGAATAATGT CAAAGAAAAG TCTGCGACTT ACCGATCCAC TGTGGCTAAT 540 

TTAGAATATA GCCGCCAAAG CGTGTATTTG CAAGTGGTGC AACAATACTA CGAGTATTTT 600 

AACAATCTCG CTCGCATGAT CGCTTTGCAA AAGAAATTAG AGCAAATCCA AACGGACATT 660 

AAAAGGGTTA CTAAGCTCTA TGACAAAGGG CTGACCACGA TTGATGATTT ACAAAGCTTA 72 0 

AAAGCGCAAG GGAATTTGAG CGAATACGAT ATTTTGGACA TGCAATTTGC TTTGGAGCAA 780 

AACCGCTTGA CTTTAGAATA CCTCACTAAC CTCAGTGTGA AAAATTTGAA AAAGACCACG 840 

ATTGATGCGC CTAATTTGCA ATTAAGAGAA AGGCAGGATT TGGTTTCTTT AAGGGAGCAG 900 

ATTTCTGCAC TCAGATACCA AAACAAGCAA CTCAATTATT ACCCCAAGAT AGATGTGTTT 960 

GACTCATGGC TTTTTTGGAT CCAAAAACCC GCTTATGCCA CAGGGCGTTT TGGGAATTTC 1020 

TACCCAGGTC AGCAAAATAC GGCTGGGGTT ACTGCGACTT TGAATATTTT TGATGATATA 1080 

GGGTTGAGCT TGCAAAAACA ATCCATCATG CTAGGCCAAT TAGCGAATGA AAAGAATTTA 1140 

GCGTATAAAA AATTGGAGCA AGAAAAAGAC GAACAGCTTT ACAGAAAGTC GCTTGATATT 1200 

GCCAGAGCTA AGATTGAATC TTCAAAGGCT AGTTTGGATG CGGCCAATCT TTCTTTTGCC 1260 

AATATTAAAA GGAAATACGA CGCTAATTTA GTGGATTTCA CTACCTATTT AAGGGGCTTA 1320 

ACCACGCGCT TTGATGCAGA AGTGGCTTAC AATTTAGCGC TCAACAATTA CGAAGTGCAA 13 80 

AAAGCCAATT ACATTTTTAA CAGCGGGCAT AAAATAGACG ACTATGTGCA TTAA 1434 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 3 9 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGIN7U. SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1239 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATGCTATCTT TTATAAGCGC GTTTGATAAA AGGGGCGTTT CAATACGCCT TCTAACAGCC 60 

TTGTTACTGC TTTTTAGTTT GGGTTTGGCT AAAGATTTAG AAATCCAAAC TTTTGTGGCT 120 

AAATACCTTT CTAAAAATCA AAAAATACAA GCCCTACAGG AGCAAATTGA CGCTTTAGAT 180 

TCTCAAGAAA AAGTCGTTAG CAAATGGGAT AACCCTATTT TGTATTTAGG CTATAACAAC 240 

GCTAACGTGA GCGATTTTTT CAGGCTGGAT AGCACCTTAA TGCAAAACAT GAGCTTGGGT 300 

TTGTCTCAAA AAGTGGATTT AAATGGTAAA AAACTCACGC AGTCTAAAAT GATCAATTTA 360 

GAAT^AACAAA AAAAAATATT AGAGCTTAAA AAAACCAAGC AGCAATTGGT GATTAATTTA 42 0 

ATGATAAACG GCATTGAAAA CTATAAAAAC CAACAAGAAA TAGAGCTTTT AAACACAGCG 48 0 

ATTAAAAATT TAGAAAACAC CCTCTATCAA GCCAACCATT CCAGTTCGCC CGATTTAATA 54 0 
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GCGATCGCCA AGTTAGAAAT TTTAAAATCG CTATTAGAAA TCCAAAAAAA CGATTTAGAA 600 

GTAGCGCTCT CTAGCAGCCA TTATTCCATG GGCGAATTGA CTTTTAAAGA AAACGAGATT 660 

TTAAGCATTG CCCCTAAAAA TTTTGAATTC AATAACGAGC AAGAGCTGCA TAACATTAGC 720 

GCCACTAATT ACGATATTGC GATCGCCAGG CTTGATGAAG AAAAAGCACA AAAAGACATG 780 

ACTCTGGCTA AAAAAAGCTT TTTAGAAGAC ATAAACGTTA CCGGGGTGTA TTATTTCCGC 840 

TCCAAACAAT ACTATAACTA CGACATGTTT AGCGTCGCTT TGTCTATCCC TTTACCTCTT 900 

TATGGCAAGC AGGCTAAATT AGTGGAGCAA AAGAAAAAAG AAAGCTTGGC GTTTAAAAGC 960 

GAAGTGGAAA ACGCCAAAAA CAAAACGCGC CACCTGGCCC TAAAACTCCT TAAAAAATTA 1020 

GAAACCTTGC AAAAAAACCT GGAATCGATC AATAAAATCA TCAAACAGAA TGAAAAAATC 108 0 

GCGCAAATTT ATGCGCTTGA TTTGAAAACT AATGGCGATT ACAACGCTTA TTACAACGCC 114 0 

TTGAATGACA AAATCACTAT TCAAATCACC CAGCTTGAAA CCTTAAGCGC TCTAAATAGT 1200 

GCTTATTTGT CCTTACAAAA TCTCAAAGGA TTAGAATGA 1239 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) 'SEQUENCE CHARACTERTSTICST" 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...414 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

ATGCGTATAG TTAGAAATTT ATTTCTTGTA TCGTTTGTGG CGTATAGTAG TGCGTTCGCA 60 

GCGGATTTAG AAACCGGAAC CAAAAACGAC AAAAAGAGCG GTAAAAAATT TTACAAACTC 120 

CATAAAAACC ATGGCTCAGA AACCGAGACT AAAAACGATA AAAAGCTTTA TGATTTCACT 180 

AAAAATAGCG GATTAGAAGG CGTGGATTTA GAAAAAAGCC CTAACCTTAA AAGCCATAAA 240 

AAAAGCGATA AAAAGTTTTA TAAACAACTC GCTAAAAACA ATATCGCTGA AGGGGTGAGC 3 00 

ATGCCGATTG TGAATTTCAA TAAAGCCCTA TCTTTTGGGC CTTATTTTGA AAGGACTAAA 360 

AGCAAAAAAA CCCAATACAT GGACGGCGGG TTGATGATGC ACATCCGTTT TTAA 414 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 



TTGATGCCAC AAAACCAGCT TGTGATCACC ATCATTGATG AATCAGGCTC TAAGCAACTC 60 

AAATTTTCTA AAAATTTAAA ACGCAACCTC ATCATTTCTG TTGTCATTCT TTTATTGATC 120 

15 GTGGGGCTTG GCGTGGGGTT TTTAAAATTT TTAATCGCTA AAATGGATAC GATGACAAGC 180 

GAGAGGAATG CGGTTTTAAG GGATTTTAGG GGTTTGTATC AAAAAAATTA CGCCCTAGCG 240 

AAAGAGATTA AAAACAAGCG AGAAGAGCTT TTTATTGTGG GGCAAAAGAT CCGTGGGCTA 3 00 

GAATCCTTGA TTGAAATCAA AAAGGGGGCT AATGGGGGAG GGCATCTCTA TGATGAAGTG 3 60 

GATTTAGAAA ATTTGAGCTT AAATCAAAAA CATTTAGCAC TCATGCTCAT TCCTAATGGC 42 0 

20 ATGCCCCTAA AAACTTATAG CGCTATCAAA CCCACTAAAG AAAGGAACCA CCCCATTAAA 480 

AAGATTAAGG GCGTTGAATC CGGGATCGAT TTTATCGCGC CATTGAACAC GCCTGTGTAT 54 0 

GCGAGCGCTG ATGGGATTGT GGATTTTGTG AAGACTCGTT CTAATGCGGG GTATGGGAAC 600 

TTGGTGCGCA TTGAACATGC GTTTGGTTTC AGCTCCATTT ATACGCACTT AGATCATGTC 660 

AATGTGCAGC CTAAAAGCTT CATCCAAAAA GGGCAGTTGA TTGGCTATAG CGGGAAGAGC 720 

25 GGTAATAGCG GCGGCGAAAA ATTGCATTAT GAAGTGCGGT TTTTGGGTAA AATTTTAGAC 780 

GCAGAAAAAT TCCTAGCATG GGATTTGGAT CATTTTCAAA GCGCTTTAGA AGAAAATAAA 840 

TTTATTGAAT GGAAGAATCT GTTTTGGGTT TTAGAAGACA TCGTCCAGCT CCAAGAGCAT 900 

GTGGATAAAG ACACCTTAAA AGGTCAGTAG 930 

30 (2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

40 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...999 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



GTGCTATATT TTTTAACCAG TTTATTTATT TGCTCTTTGA TTGTTTTGTG GTCTAAAAAA 60 
TCCATGCTCT TTGTGGATAA CGCTAATAAA ATCCAAGGCT TCCATCATGC AAGAACCCCA 12 0 
55 CGAGCCGGGG GGCTTGGGAT CTTTCTTTCT TTTGCGTTGG CTTGTTATCT TGAACCTTTT 18 0 
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GAGATGCCTT TTAAGGGGCC TTTTGTTTTC TTAGGGCTAT CGCTAGTGTT TTTGAGCGGT 24 0 

T^TTTTAGAAG ACATTAACCT TTCATTAAGC CCCAAAATAC GCCTTATTTT GCAAGCTGTA 300 

GGGGTCGTTT GCATCATTTC ATCAACGCCT TTAGTGGTGA GCGATTTTTC GCCCCTTTTT 360 

AGCTTGCCTT ATTTCATCGC TTTTTTATTC GCTATTTTTA TGCTGGTGGG TATCAGTAAC 420 

GCTATTAATA TCATTGACGG GTTTAACGGG CTTGCATCTG GGATTTGCGC GATCGCGCTT 480 

TTAGTCATTC ATTATATAGA CCCTAGCAGT TTGTCTTGTT TGCTCGCTTA CATGGTGCTT 540 

GGGTTTATGG TGTTAAATTT CCCTTCAGGA AAGATTTTTT TAGGCGATGG GGGGGCGTAT 600 

TTTTTGGGTT TGGTGTGCGG GATTTCTCTC TTGCATTTGA GTTTGGAGCA AAAAATCAGC 660 

GTGTTTTTTG GGCTCAATTT AATGCTTTAT CCGGTCATAG AGGTGCTTTT TAGTATCCTT 720 

AGGCGCAAAA TAAAACGCCA GAAAGCCACC ATGCCGGATA ATTTGCATTT GCACACCCTT 78 0 

TTATTTAAAT TCTTGCAACA ACGCTCTTTC AATTACCCTA ACCCTTTATG CGCGTTTATC 840 

CTTATTCTAT GCAACCTGCC TTTTATTTTA ATAAGCGTTT TGTTTCGCTT GGACGCTTAT 900 

GCGCTCATTG TGATTAGCCT AGTCTTTATC GCATGCTATT TAATAGGCTA TGCTTATTTG 960 

AATAGGCAAG TTTGCGCTTT AGAAAAGCGG GCGTTTTAA 999 

-{-2 ) -I-NFGRMAT-I-GN--FOR^ SEQ -XEJ NG :-5 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

ATGAACATAT TCAAGCGTAT TATTTGCGTA ACCGCTATTG TTTTAGGTTT TTTTAACCTT 60 

TTAGACGCCA AACACCACAA AGAAAAAAAA GAAGACCACA AAATCACTCG TGAGCTTAAA 120 

GTGGGCGCTA ACCCTGTGCC GCATGCGCAA ATCTTGCAAT CAGTTGTGGA TGATTTGAAA 180 

GAGAAAGGGA TCAAATTAGT GATCGTGTCT TTTACGGATT ATGTGTTGCC TAATTTAGCG 240 

CTCAATGACG GCTCTTTAGA CGCGAATTAC TTCCAGCACC GCCCTTATTT GGATCGGTTT 3 00 

AATTTGGACA GAAAAATGCA CCTTGTTGGT TTGGCCAATA TCCATGTGGA GCCTTTAAGA 360 

TTTTATTCTC AAAAAATCAC AGACATTAAA AACCTTAAAA . AAGGCTCAGT GATTGCTGTG 420 

CCAAATGATC CGGCCAATCA AGGCAGGGCG TTGATTTTAC TCCATAAACA AGGCCTTATC 4 80 

GCTCTCAAAG ACCCAAGCAA TCTATACGCT ACGGAGTTTG ATATTGTCAA AAATCCTTAC 54 0 

AACATCAAAA TCAAACCCCT AGAAGCTGCG TTATTGCCTA AGGTTTTAGG GGATGTGGAT 600 

GGGGCTATCA TAACAGGGAA TTATGCCTTG CAAGCAAAAC TCACCGGAGC CTTATTTTCA 660 

GAAGATAAGG ACTCGCCTTA TGCTAATCTT GTAGCCTCTC GTGAGGATAA TGCGCAAGAT 720 

GAAGCGATAA AAGCGTTGAT TGAAGCCTTA CAGAGCGAAA AGACCAGGAA ATTCATTTTG 780 

GATACCTATA AGGGGGCGAT TATCCCGGCT TTTTAA 816 

(2) INFORMATION FOR SEQ ID NO: 56: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1,..951 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

ATGCAAGAAT TCAGTTTGTG GTGCGATTTT ATAGAAAGGG ATTTTTTAGA AAACGATTTT 60 

TTAAAGCTCA TCAATAAGGG GGCTATTTGC GGGGCGACGA GTAACCCTAG TTTGTTTTGC 12 0 

GAAGCGATCA CAAAAAGCGC GTTTTATCAA GATGAAATCG CTAAACTCAA AGGCAAAAAA 18 0 

25 GCTAAAGAJA TTTATGAAAC TCTGGCACTA AAGGATATTT TACAAGCCTC TAGCGCGTTA 24 0 

ATGCCTTTGT ATGAAAAAGA CCCTAACAAC GGCTACATCA GCCTAGAAAT TGACCCCTTT 300 

TTAGAAGACG ATGCGATTAA AAGCATTGAT GAAGCCAAGC GGTTATTCAA AACATTAAAC 360 

CGCCCCAATG TGATGATTAA AGTCCCGGCG AGTGAAAGCG CTTTTGAAGT CATTAGCGCT 420 

CTGGCTCAAG CCTCTATCCC CATTAATGTA ACTTTAGTCT TTTCGCCTAA AATTGCCGGT 4 80 

30 GAAATCGCTC AAATCTTAGC CAAAGAAGCA CGAAAAAGAG CGGTCATTAG CGTGTTTGTC 54 0 

TCACGATTTG ACAAAGAAAT AGACCCACTA GTGCCACAAA ATTTGCAAGC TCAAAGTGGG 600 

ATCATGAACG CTACCGAGTG TTATTATCAA ATCAACCAGC ATGCTAATAA GCTAATAAGC 66 0 

ACCCTTTTTG CATCCACCGG CGTTAAATCT AATTCTTTAG CTAAAGATTA CTACATTAAA 72 0 

GCGCTGTGTT TTAAAAACTC TATCAACACA GCCCCCCTAG ACGCCCTAAA CGCTTATTTG 780 

35 CTTGACCCAA ACACCGAGTG TCAAACCCCT TTAAAAATCA CAGAAATTGA AGCGTTCAAA 84 0 

AAAGAATTAA AAACGCACAA TATTGATTTA GAAAACACCG CCCAAAAACT CCTTAAAGAA 900 

GGCTTGATAG CGTTCAAACA ATCCTTTGAA AAGCTTTTAA GCAGTTTTTG A 951 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



55 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION l..,783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ATGAAAACAA ATGGTCATTT TAAGGATTTT GCATGGAAAA AATGCTTTTT AGGCGCGAGC 60 

GTGGTGGCTT TATTAGTGGG GTGTAGCCCG CATATTATTG AAACCAATGA AGTTGCTTTG 12 0 

AAATTGAATT ACCATCCAGC TAGCGAGAAA GTTCAAGCGT TAGATGAAAA GATTTTACTT 18 0 

TTAAGGCCAG CTTTCCAATA CAGCGATAAT ATTGCTAAAG AGTATGAAAA CAAATTCAAG 240 

AATCAAACCA CGCTTAAAGT TGAAGAGATC TTGCAAAATC AGGGCTATAA GGTTATTAAT 3 00 

GTGGATAGCA GCGATAAAGA CGATTTTTCT TTTGCGCAAA AAAAAGAAGG GTATTTGGCT 360 

GTCGCTATGA ATGGCGAAAT TGTTTTACGC CCCGATCCTA AAAGGACCAT ACAGAAAAAA 42 0 

TCAGAACCCG GGTTATTATT CTCCACTGGT TTGGATAAAA TGGAAAGGGT TTTAATCCCG 48 0 

GCTGGGTTTG TCAAGGTTAC CATACTAGAG CCTATGAGTG GGGAATCTTT GGATTCTTTT 54 0 

. ACGATGGATT— TGAGCGAGTT-GGACATCCA-A GAAAAAT-TGT-^ -TAAAAAGGAC-GGATTeAAGe o 0 0 

CATAGCGGAG GGTTAGTTAG CACTATGGTT AAGGGGACGG ATAATTCTAA TGACGCAATT 660 

AAGAGCGCTT TGAATAAGAT TTTTGCAAGT ATCATGCAAG AAATGGATAA GAAACTCACT 72 0 

CAAAGGAATT TAGAATCTTA TCAAAAAGAC GCCAAGGAAT TAAAAAACAA GAGAAACCGA 78 0 

TAA 783 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...4149 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

TTGAATTTTA ATAACCTTAC GGCTAATGGG GCGTTAAATT TTAATGGTTA TGCGCCCTCT 60 

TTAACTAAGG CTTTAATGAA TGTCAGCGGG CAGTTTGTTT TAGGGAATAA TGGGGATATT 120 

AATTTATCTG ACATCAATAT CTTTGACAAC ATCACAAAAT CTGTAACTTA CAACATCTTA 180 

AACGCTCAAA AAGGGATTAC TGGCATTAGT GGGGCTAATG GCTATGAAAA AATCCTTTTT 24 0 

TATGGCATGA AAATCCAAAA CGCTACCTAT AGCGATAATA ACAACATCCA AACTTGGTCG 300 

TTTATAAACC CTCTCAATTC TTCTCAAATC ATTCAAGAGA GCATTAAAAA TGGGGATCTA 360 

ACCATAGAAG TTTTAAATAA CCCTAACTCG GCTTCCAACA CTATTTTTAA TATCGCTCCT 420 

GAGCTTTATA ATTACCAAGA TTCTAAGCAA AATCCTACCG GCTATAGCTA TGATTATAGC 480 

GACAATCAAG CAGGCACTTA TTACTTGACA AGCAACATTA AAGGTCTTTT CACCCCTAAA 540 

GGCTCTCAAA CGCCTCAAAC CCCAGGCACT TATAGCCCAT TTAACCAGCC TTTGAATAGT 600 

TTGAATATCT ACAATAAGGG TTTTTCTAGC GAGAATTTAA AAACGCTTTT AGGGATCCTT 66 0 
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TCTCAAAATT CCGCCACCTT AAAAGAAATG 
AACATTAATG AAGTGTTGCA ACTCTTAGAT 
GCGCTCCTAG AAACGATCAA CCATTTGACT 
AATCTCGTTA TAGGCGCTAC CCAAGATAAT 
5 GGGGGCAATG GCTATAGCAG CCCTTGCGCG 
AACACTTACT TGGGGCAATT ATTAGGCTCA 
GATTTTAAAG CTAAAAGCAT TTATATTACC 
AGCGGAGGGA GCGCGGATGT AACCTTTCAA 
AACATAGAAG CTCAAGCCAC AGACAATATC 
10 AAAATCTTTA ATCAGGGGAA TTTAGCGAAT 
AAGCAAGCCG GCGGTTTAGG GAACTTTATA 
TTACCCGCTA GCTTGCAAGA TGAAACCTTA 
GATTTATTGA ATAATAGTGG AGTCATGAAT 
AGCATTTTTG GCAATTTTGT TACCCCATCC 
15 TTAAAAAGCA TGCTAGACGA TAAAGGGCTT 
TCTGAATTAA GCTCTATTTT AGGCGTGATT 
CTGCAAAAAG ACATTGGTGT GGTAGCGAAC 
GTTGTCAAAA AGCTAGAAAG TCAAGGCTTG 
CAAGGCGGGT TGAGCGGCGT TTATAATCAA 
20 CAAAACGCGC TCAAAGAAAA CGATTTAGGC 
TTTTGGCAAA AAGGGTATTT TAACTTTTTA 
TCTTTTAGTA ACGCTACTGG GGGTAGTTTG 
AATGGCGATA ATACGATTGA CTTTAGCAAG 
GGTGTTTCTA ATATCAATAT CACCACCCTA 
25 GGTTTGAATA ATGTGAGCGT TCAAAAAGGA 
ACAACCAAAA ACAGCTCTCC TGCAAACTCT 
GTGCACGCTA ATAATTTCAC TTTCTTAGGC 
TCTCAAGTAA CAAATAATAG CGTTATAGGC 
CAAGCTAATA ATTTAACGAT CACCAACGCT 
30 ATTGATGGTA ATTTCACCTT AAACCAACAA 
AATGTCATGG GGAATTTTAA TAGCTATGGC 
AGTCATGCTA TTATCAATAC TCAAGGCACA 
ATCCAATTCA ACGCTTCTTC AAAAGAAGTG 
GCCATTTATT ACGGGTATAA CAACCAAATC 
35 AAGCTTTATG CGCTCATTGA TATTAATGGC 
ACCTATAACG GGCAAGCCGT GAGCGTTAAA 
TCTCAAAATC AATACATTTA CACTTCCATT 
AATGATCCTA TCAATAACCC ACAAGCCCCC 
GGCGTTCAAA GCGTGGATAG CATCGATCAA 
40 AATAAAATCT TTGAAACTAA AGGAAGCCCT 
TCCACAAAAG ATTTAACCAC GATCGCTGGA 
AACCCTAATT TTAAAAATGA CGCCACTAAT 
ATGAGTCGTT TAGCCAAGCT CTCTGACACT 
CGCTTAGAAG CCCTTAAAAA CAAGCGATTC 
45 ATTTTAAAAT ACTCTCAAAG GAATAGAGTT 
GGGGCTAGTT TCATTAGTGG AGGTACTGGA 
AGGTTTATTA AGGGCGTGAT TGTGGGAGGT 
GCAAACATCA CTCAATCAGG CTCTAGCAAT 
ATCAAAAGAA GCGAGCTAAC CATGAGCTTG 
50 ATCAACTCCT ATGACCCCCT ACTCTCAATC 
ACGACTGACG CTAAAATCAA TTATGGCTAT 
TTTAAACCCC AAGTAGGCTT AAGCTATTAT 
ATGGATGATC CTATTTACAA CCAATTCAGA 
CTAACGATCA ATTTTGCCCT AGAAAGTCGG 
55 GTGATTGCGG ATGTGGGCAG AGACTTATTC 
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ATTGAATCCA ACCAACTAGA CAATATCACT 720 

AAGATTAAAA TCACCCAAGC GCAAAAGCAA 780 

GACAACATCA ATCAAACCTT TAATAACGGG 840 

GTTACAAACT CTACTAGCTC TATATGGTTT 900 

CTAGATAGCG CCACTTGTTC TTCTTTTAGA 960 

ACTTCCCCTT ATTTAGGCTA CATTAACGCT 102 0 

GGGACAATTG GAAGTAGTAA CGCTTTTGAA 1080 

AGCGCTAATA ACTTAGTGTT GAATAAAGCT 1140 

TTTAATCTTT TGGGTCAAGA AGGGATTGAT 1200 

GTTCTTAGTC AAATGGCTAT GGAAAAAATC 1260 

GAAAACGCTC TAAGCCCTTT GAGTAAGGAA 1320 

GGCCAACTTA TAGGTCAAAA TAACTTAGAT 13 80 

GAAATCCAAA ACATTATCAG TCAAAAACTA 1440 

ATCATAGAAA ACTACCTTGC TAAGCAGTCT 1500 

TTGAATTTTA TCGGTGGGTA TATAGACGCT 1560 

TTAAAGGATA TTACTAACCC CCCTACAAGC 1620 

GACTTGTTGA ACGAGTTTTT AGGACAAGAT 168 0 

GTGAGTAATA TCATCAATAA TGTTATTTCT 1740 

GGTTTAGGGA GCGTGTTGCC GCCCTCTTTA 1800 

ACTCTTTTAT CGCCTAGAGG CTTGCATGAT 18 60 

AGCAATGGCT ATGTTTTTGT CAATAACAGC 1920 

AATTTTGTCG CCAACAAGTC TATTATCTTT 1980 

TATCAAGGCG CATTGATTTT TGCTTCTAAT 2040 

AACGCCACTA ATGGCTTAAG CCTTAATGCG 2100 

GAAATTTGTA TCAATTTAGC CAATTGCCCT 2160 

AGCGTAACCC CCACTAATGA GTCTTTAAGC 2220 

ACAATCATCT CTAATGGGGC TATTGATTTG 2280 

ACGCTCAATC TCAATGAAAA TGCGACCTTG 2340 

TTTAACAACG CCTCTAACTC TACGGCTAAT 2400 

GCGACTTTAA GCACTAACGC TAGTGGTTTG 2460 

GATTTGGTGT TTAACCTCAG TCATTCAGTT 2520 

GCGACGATCA TGGCCAATAA TAACCCTTTG 2580 

GGTACTTACA CGCTGATTGA TAGCGCTAAA 2640 

ACAGGAGGCA GTAGCCTGGA TAATTACCTT 2 700 

AAGCACATGG TGATGACTGA CAACGGCTTA 2 760 

GATGGCGGTT TAGTTGTAGG CTTTAAGGAC 2820 

CTTTATAATA AAGTGAAAAT CGCTGTTTCT 2 880 

ACTTTAAAAC AATATATCGC TCAAATTCAG 2940 

GCTGGGGGAA ATCAAGCGAT TAATTGGCTC 3 000 

TTATTCGCTC CCTATTATCT AGAGAGCCAC 3 060 

GATATTGCTA ACACTTTAGA AGTCATCGCT 3120 

ATTTTACAGA TCAACACCTA CACGCAGCAA 3180 

TCAACTTTCG CCCGTTCTGA TTTCTTAGAA 3240 

GCTGATGCGA TCCCTAACGC TATGGATGTG 33 00 

AAAAATAATG TGTGGGCGAC AGGAGTTGGA 3 360 

ACTTTATATG GTATCAATGT AGGGTATGAT 3420 

TATGCCGCTT ATGGGTATAG CGGGTTCCAT 3480 

GTCAATGTGG GCGTTTATAG CCGAGCGTTT 3540 

AATGAGACTT GGGGATACAA TAAAACTTTC 3600 

ATCAATCAGT CTTACAGATA CGACACTTGG 3660 

GATTTCATGT TTAAAGATAA AAGCGTTATT 372 0 

TACATTGGTT TGTCTGGTTT AAGGGGCATT 3 780 

GCCAATGCTG ACCCTAATAA AAAATCCGTT 3 84 0 

CATTATTTCA ATAAAAACTC TTATTATTTT 3 900 

ATTAATTCTA TGGGGGATAA AATGGTGCGT 3 960 
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TTCATCGGTA ATAACACCCT AAGCTATAGA GATGGTGGCA GATACAACAC TTTTGCTAGC 4 02 0 

ATTATCACAG GCGGGGAGAT AAGATTGTTC AAAACCTTTT ATGTGAATGC GGGCATAGGG 4 080 

GCTAGGTTTG GGCTTGATTA TAAAGATATT AATATTACCG GAAATATTGG TATGCGCTAT 4140 

GCTTTTTAA 414 9 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

_(.iii)_HYP.OTHETICAL.: -NO- - _ „ 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...789 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

ATGAAAAAAA TTGGTTTGAG CTTGTGTTTG GTTTTGAGTT TGGGTTTTTT AAAAGCCCAT 60 

GAAGTGAGCG CTGAAGAGAT TGCGGATATT TTCTACAAAC TCAACGCCAA AGAGCCTAAA 120 

ATGAAAATCA ACCACACGAA GGGGTTTTGC GCTAAAGGCG TGTTCCTCCC TAACCCGCAA 180 

GCAAGAGAGG ATTTAGAGGT GCCACTACTC AATGAAAAAG AAATCCCTGC GTCTGTAAGG 240 

TATTCTTTAG GGGGCGTGGC GATGGACGAT AAAAGCAAGG TTAGGGGAAT GGCGTTAAAA 3 00 

CTAGAAAATC AAAACGCTAG TTGGACAATG GTGATGCTCA ATACAGAAAT CAATTTTGCC 3 60 

AAAAACCCTG AAGAATTCGC CCAATTTTTT GAAATGAGAC TTCCTAAAAA TGGCAAGGTA 42 0 

GATGAAGCAA GAATCAAAAA GCTTTACGAA GAAGTCCCCT CTTATAGGAA TTTTGCCGCC 480 

TATATGAAAA CGATAGGGAT TAGCTCAAGC GTGGCTAATA CGCCTTATTA TAGCGTGCAT 540 

GCGTTCAAGT TTAAAGATAA GAAAGAAAAA TTATTGCCTG CGAGGTGGAA ATTTGTGCCT 600 

AAAGAGGGCG TTAAATACTT AAATCCTCAA GAATTAAAGC AAAAAGATTC AAATTATCTG 660 

CTCTCTTCAT TCCAACAACA CCTTAAAAAT AAACCCATAG AATACCAAAT GTATTTGGTG 720 

TTTGCGAATC AAAATGATGC CACCAACGAC ACGACCGCGC TTTGGAAAGG CAGCATAAGG 780 

AATTATTAG 789 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
< D ) TOPOLOGY : c i r cul ar 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 



741 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...741 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ATGAAACAAT TTAAAAAGAA ACCAAAAAAG ATAAAACGAT CGCATCAAAA TCAAAAAACA 60 

ATCTTAAAGC GTCCTTTATG GCTTATGCCT TTACTGATTG GCGGGTTTGC TAGTGGGGTG 120 

TATGCGGATG GAACAGACAT TTTGGGGCTT AGTTGGGGGG AAAAAAGCCA AAAGGTATGC 180 

GTGCATCGTC CATGGTATGC TATATGGAGT TGCGATAAAT GGGAGGAAAA AACACAACAA 240 

TTTACAGGAA ACCAACTCAT CACAAAAACT TGGGCAGGGG GTAATGCGGC TAACTACTAC 300 

CACTCTCAAA ACAACCAAGA CATCACAGCC AATTTAAAAA ATGATAACGG CACTTATTTT 360 

TTAAGCGGTC TGTATAACTA CACCGGAGGG GAATATAATG GGGGGAATTT AGACATTGAA 420 

TTAGGCAGTA ACGCTACTTT TAATCTAGGT GCGAGTAGTG GGAATAGCTT CACTTCTTGG 480 

TATCCTAATG GGCATACTGA TGTTACTTTT AGCGCTGGGA CTATCAATGT GAATAACAGC 54 0 

GTAGAAGTGG GCAATCGTGT GGGATCGGGA GCTGGCACGC ACACCGGCAC AGCCACTTTA 600 

AACTTGAACG CTAATAAGGT TACTATCAAT TCCAATATCA GCGCGTATAA AACTTCGCAA 660 

GTGAATGTAG GCAATGCTAA CAGCGTTATT ACCATTAATT CGGTTTCTTT AAATGGGGAA 72 0 
TACTTGCAGT TCTTTAGCTA G 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 73 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 ... 738 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



60 
120 



ATGATAAAAA AGACCCTTGC ATCGGTTTTA TTAGGATTGA GTTTGATGAG TGTGTTAAAT 
GCCAAAGAAT GCGTTTCGCC CATAACAAGA AGCGTTAAGT ATCATCAGCA AAGTGCTGAG 

ATCAGAGCCT TGCAATTACA AAGTTACAAA ATGGCGAAAA TGGCGCTAGA CAATAACCTT ISO 

AAGCTCGTTA AAGACAAAAA GCCAGCCGTC ATCTTGGATT TAGATGAAAC CGTTTTGAAC 240 

ACTTTTGATT ATGCGGGCTA TTTAGTCAAA AACTGCATTA AATACACCCC AGAAACTTGG 3 00 

GATAAATTTG AAAAAGAAGG CTCTCTTACG CTCATTCCTG GAGCGCTAGA CTTTTTAGAA 360 

TACGCTAATT CTAAGGGCGT TAAGATTTTT TACATTTCTA ACCGCACCCA AAAAAATAAG 420 
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GCATTCACTT TAAAAACGCT CAAAAGCTTT AAGCTCCCCC AAGTGAGTGA AGAATCCGTT 4 80 

TTGTTAAAGG AAAAAGGCAA GCCTAAAGCC GTTAGGCGGG AGTTAGTCGC TAAGGATTAT 54 0 

GCGATTGTTT TACAAGTGGG CGACACTTTG CATGATTTTG ACGCCATTTT TGCTAAAGAC 600 

GCTAAAAACA GCCAAGAACA ACAAGCCAAA GTCTTGCAAA ACGCTCAAAA ATTCGGCACA 660 

GAATGGATCA TTTTACCCAA CTCTCTTTAT GGCACATGGG AAGATGGGCC TATAAAAGCA 720 

TGGCAAAATA AAAAATAA 73 8 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 

(B) TYPE: nucleic acid 

(C) STRTUTOEDNESS : double 

(D) TOPOLOGY: circular 

4ii)— MOLECULE-TYPE:— DNA-CgGnorrri^^ - 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

TTGTGGTGTT TAAAAACCCC TATCATAGGG CATGGCATGA AGAAAAAAGC AAAAGTCTTT 60 

TGGTGTTGTT TTAAAATGAT TCGTTGGTTG TATTTGGCGG TCTTTTTTTT GTTGAGCGTA 120 

TCAGACGCTA AAGAAATCGC TATGCAACGA TTTGACAAAC AAAACCATAA GATTTTTGAA 180 

ATCCTTGCGG ATAAAGTGAG CGCCAAAGAC AATGTGATAA CCGCCTCAGG GAATGCGATC 240 

CTATTGAATT ATGACGTGTA TATTCTAGCG GATAAGGTGC GTTATGACAC CAAGACTAAA 3 00 

GAAGCGTTAT TAGAAGGCAA TATTAAGGTT TATAGGGGCG AGGGCTTGCT CGTTAAAACC 3 60 

GATTATGTGA AATTGAGTTT GAACGAAAAA TATGAGATCA TTTTCCCCTT TTATGTCCAA 420 

GACAGCGTGA GCGGGATTTG GGTGAGCGCG GATATTGCTA GCGGGAAGGA TCAAAAATAT 4 80 

AAGATTAAAA ACATGAGCGC TTCAGGGTGC AGCATTGACA ACCCCATTTG GCATGTCAAT 540 

GCGACTTCAG GCTCATTTAA CATGCAAAAA TCGCATTTGT CAATGTGGAA TCCTAAGATT 6 00 

TATGTCGGCG ATATTCCTGT ATTGTATTTG CCCTATATTT TCATGTCCAC GAGCAATAAA 660 

AGAACTACCG GGTTTTTATA CCCTGAGTTT GGCACTTCCA ACTTAGACGG CTTTATTTAT '720 

TTGCAACCCT TTTATTTAGC CCCCAAAAAC TCATGGGATA TGACCTTTAC CCCACAAATC 780 

CGTTACAAAA GGGGTTTTGG CTTGAATTTT GAAGCGCGCT ACATCAACTC TAAGACGCAG 840 

GTTTTTATTC AATGCGCGCT ATTTTAG 867 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE': DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1. . .387 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

TTGATGTTTA AAAAAATGTG TTTGAGCCTG CTAATGATAA GCGGTGTTTG TGTGGGGGCA 60 

AAGGATTTGG ATTTCAAGCT GGATTATCGC GCGACTGGGG GGAAATTCAT GGGGAAAATG 120 

ACGGACTCTA GTCTTTTAAG TATCACTTCT ATGAACGATG AACCGGTGGT GATTAAAAAC 180 

CTTATTGTCA ATAGGGGAAA TTCATGCGAA GCGACTAAAA AAGTAGAACC CAAATTTGGC 24 0 

GATAAGTTTA AAAAAGAAAA ACTCTTTGAT CATGAATTAA AATACTCGCA ACAGATATTT 3 00 

TACCGCCTGG ATTGCAAGCC TAACCAATTG TTAGAAGTTA AAATCATCAC GGACAAGGGC 360 

GAATATTACC ATAAATTTTC CAAATAG 387 

(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : ci r cular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...S10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 

ATGCAAGCGT TAAAATCATT GCTTGAAGTG ATTACAAAAC TCCAGAATCT AGGCGGCTAT 60 

TTGATGCATA TAGCTATTTT CATCATTTTT ATTTGGATTG GAGGGCTTAA GTTTGTGCCT 120 

TACGAAGCTG AAGGGATCGC CCCTTTTGTG GCCAACTCCC CTTTCTTTTC TTTCATGTAT 180 

AAATTTGAAA AACCTGCATA CAAACAACAC AAAATGTCTG AATCCCAATC CATGCAAGAA 240 

GAAATGCAAG ATAACCCTAA AATCGTTGAA AACAAAGAAT GGCATJ\AAGA AAACCGCACT 3 00 

TATTTAGTGG CTGAAGGTTT AGGGATTACG ATCATGATCC TAGGCATTTT GGTGCTTTTG 360 

GGGCTTTGGA TGCCTTTAAT GGGCGTAGTT GGGGGCTTGC TTGTCGCTGG AATGACGATC 420 

ACCACCCTAT TCTTTTTTAT TCACAACGCC AGAAGTGTTT GTCAATCAGC ATTTCCCATG 4 80 

GCTTTCTGGG GCTGGAAGGC TAGTGGTTAA 510 



9818323A1_I_> 
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(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

-(-A-)— GRGAWI-SM-:--'He-l-icobacter -pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

ATGATTGAAT GGATGCAAAA TCATAGAAAG TATTTAGTGG TTACGATATG GATAAGCACG 60 

ATCGCTTTTA TTGCCGCCGG AATGATAGGT TGGGGGCAAT ACAGCTTTTC TTTAGATAGC 120 

GATAGCGCTG CCAAAGTGGG ACAGATTAAG ATTTCTCAAG AAGAATTAGC CCAAGAATAC 18 0 

CGCCGCCTTA AAGACGCCTA TGCTGAGTCT ATCCCTGATT TTAAAGAACT CACCGAAGAT 24 0 

CAAATCAAAG CCATGCATTT AGAAAAAAGC GCGCTAGATT CGCTCATCAA TCAAGCTTTA 3 00 

TTGAGGAATT TCGCTTTAGA TTTAGGGCTT GGTGCTACCA AGCAAGAAGT GGCCAAAGAG 360 

ATCAGAAAAA CGAACGTTTT TCAAAAAGAT GGCGTTTTTG ATGAAGAATT GTATAAAAAT 420 

ATCTTAAAAC AAAGCCATTA CCGCCCCAAG CATTTTGAAG AAAGCGTTGA AAGGCTTTTA 4 80 

ATCCTTCAAA AAATCAGCGC TCTATTCCCC AAAACCACCA CCCCTTTGGA GCAATCCAGT 54 0 

CTATCGCTTT GGGCAAAATT GCAAGACAAA TTAGACATTC TTATCCTAAA TCCTAATGAT 600 

GTTAAAATCT CTCTCAATGA AGAAGAGATG AAAAAATATT ATGAAAACCA TAGAAAGGAT 660 

TTTAAAAAGC CCACAAGCTT TAAAACACGC TCTTTATATT TTGACGCTAG TTTAGAAAAA 72 0 

ACTGATTTGA AAGAGTTGGA GGAATACTAC CATAAAAACA AGGTGTCTTA TTTGGACAAA 780 

GAGGGGAAAT TACAGGATTT TAAAAGCGTT CAAGAGCAAG TCAAGCATGA TTTAAACATG 84 0 

CAAAAGGCGA ATGAAAAAGC CTTAAGGAGC TATATCGCTC TAAAAAAGGG GAACGCACAA 900 

AACTACACCA CGCAAGATTT TGAAAAAAAC AACTCCCCCT ATACTGCTGA AATCACGCAA 960 

AAACTCACCG CTCTCAAGCC CCTTGAAGTC CTAAAACCAG AGCCTTTTAA AGATGGTTTT 1020 

ATCGTGGTGC AGCTTGTCTC TCAAATTAAA GACGAATTGC AAAATTTTGA TGAAGCCAAA 1080 

AGCGCTCTTA AAACCCGTCT GACTCAAGAA AAAACCCTTA TGGCGTTGCA AACTTTAGCT 114 0 

AAAGAAAAGC TTAAGGATTT TAAAGGGAAA AGCGTGGGTT ATGTAAGCCC TAATTTTGGA 12 00 

GGCACTATCA GTGAACTTAA CCAAGAAGAG AGCGCGAAGT TTATCAACAC CCTTTTTAAC 1260 

CGCCAGGAAA AAAAAGGGTT TGTAACCATA GGTAATAAAG TGGTGCTTTA TCAAATCACA 132 0 

GAGCAAAATT TCAATCACCC CTTTAGTGCA GAAGAAAACC AATACATGCA GCGTTTAGTC 138 0 

AATAACACTA AAACGGATTT TTTTGATAAA GCGTTGATAG AAGAATTGAA AAAACGCTAT 144 0 

AAGATAGTCA AATACATTCA ATAA 1464 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: circular 

(ii) MOLECUIiE TYPE: DNA (genomic) 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .429 



15 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



429 



ATGAAAACGA ACTTTTATAA AATTAAATTA CTATTTGCTT GGTGTCTTAT CATTGGCATG 60 

TTTAACGCTC CGCTTAACGC TGACCAAAAC ACGGATATAA AAGATATTAG TCCTGAAGAT 120 

20 ATGGCGCTAA ATAGCGTGGG GCTTGTTTCT AGAGATCAGC TAAAAATAGA GATCCCTAAA 180 

GAAACCCTAG AGCAAAAAGT GGCCATACTC AATGACTATA ATGATAAGAA TGTTAATATC 24 0 

AAGTTTGACG ACATAAGTTT AGGGAGTTTC CAACCTAATG ATAATCTAGG TATCAATGCG 3 00 

ATGTGGGGCA TTCAAAATCT TCTCATGAGC CAAATGATGA GCAATTACGG TCCAAACAAT 360 

TCTTTCATGT ATGGCTATGC GCCAACATAC TCAGATTCAT CGTTTTTACC ACCGATCTTA 420 

25 GGGTATTAA 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHJ^RACTERISTICS : 
30 (A) LENGTH: 627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : c i r cular 

35 (ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 
( iv) ANTI - SENSE : NO 



( ix ) FEATURE : 
45 (A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .627 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

50 TTGATCAACA ATAATAATAA CAATAAAAAA CTGAGAGGCT TTTTTTTGTUV AGTTCTCTTA 60 

AGTCTCGTTG TTTTCAGTTC GTATGGGTCA GCAAATGACG ATAAAGAAGC CAAAAAAGAA 120 

GCGCTAGAAA AAGAAAAAAA CACTCCCAAT GGGCTTGTTT ATACGAATTT AGATTTTGAT 180 

AGTTTTAAAG CGACTATCAA AAATTTGAAA GACAAGAAAG TAACTTTCAA AGAAGTCAAT 24 0 

CCCGATATTA TCAAAGATGA AGTTTTTGAC TTCGTGATTG TCAATAGAGT CCTTAAAAAA 3 00 

55 ATAAAGGATT TGAAGCATTA CGATCCAGTT ATTGAAAAAA TCTTTGATGA AAAGGGTAAA 36 0 
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GAAATGGGAT TGAATGTAGA ATTACAGATC AATCCTGAAG TGAAAGACTT TTTTACTTTC 420 

AAAAGCATCA GCACGACCAA CAAACAACGC TGCTTTCTAT CATTGCACGG AGAAACAAGA 4 80 

GAAATTTTAT GCGATGATAA GCTATATAAT GTTTTATTGG CCGTATTCAA TTCTTATGAT 54 0 

CCTAATGATC TTTTG7VAACA CATTAGCACC ATAGAGTCTC TCAAAAAAAT CTTTTATACG 600 

ATTACATGTG AAGCGGTATA TCTATAA 627 



(2) INFORMATION FOR SEQ ID NO: 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1- . .738 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



ATGGCAGGCA CACAAGCTAT ATATGAATCA TCTTCTGCAG GATTCTTATC GCAAGTCTCC 60 

TCAATCATCT CAAGCACAAG TGGTGTCGCA GGGCCATTTG CAGGAATAGT AGCGGGCGCT 12 0 

ATGACAGCAG CGATTATTCC TATTGTTGTG GGATTTACTA ATCCGCAAAT GACCGCTATC 180 

ATGACCCAAT ACAATCAAAG CATCGCTGAA GCTGTAAGCG TGCCTATGAA AGCCGCTAAC 24 0 

CAACAATACA ACCAATTGTA TCAAGGTTTT AACGATCAAA GCATGGCTGT GGGGAACAAT 300 

ATCTTAAATA TCAGCAAATT AACAGGGGAA TTTAACGCGC AAGGCAACAC GCAAAGCGCG 36 0 

CAAATTAGTG CTGTCAATAG TCAGATTGCA AGCATTTTAG CGAGTAACAC TACCCCTAAA 420 

AATCCTAGCG CTATTGAAGC TTATGCGACG AATCAAATCG CTGTTCCTAG CGTGCCAACA 480 

ACGGTTGAAA TGATGAGCGG TATATTAGGC AATATTACAA GCGCAGCACC AAAATACGCC 540 

CTAGCTCTAC AAGAGCAACT GCGTTCTCAA GCAAGCAACA GCTCAATGAA TGATACAGCC 600 

GATTCCCTTG ATAGCTGTAC CGCTTTAGGC GCACTTGTTG GCTCATCAAA AGTGTTTTTC 660 

AGTTGCATGC AAATTTCTAT GACTCCTATG AGTGTTTCTA TGCCCACTGT TATGCCAAAT 720 

ACCAGCGGTT GCCACTAA 738 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...1104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

ATGATTAAAA GCGTAGAGAT TGAAAATTAC AAAAATTTTG AGCACCTTAA AATGGAAAAT 60 

TTTAAACTCA TCAACTTTTT TACCGGTCAA AACGATGCGG GTAAAACCAA TCTTTTAGAA 120 

GCTCTTTATA CCAACACAGG CCTTTGTGAT CCTACTGCCA ATCAAGTCAG TCTTCCTCCT 180 

GAACATGCCG TGAATATTAG TGAATTCAGA AAAATCAAAC TCGATGCCGA CAACCTAAAA 240 

ACCTTTTTTT ATCAAGGAAA CACCGCTAAT CCCATTAGTA TCCGCACTGA ATTTGAACAT 3 00 

GCTACTATCC CTCTTACTAT CCAATACCCC ACACAAACCA GTTACAGCAA AGACATCAAT 3 60 

TTGAATAGCG ATGATGCTCA TATGACAAAC CTTATAAACA CAACAATAAC GAAGCCACAG 420 

CTCCAATTTT CCTACAATCC ATCCCTTTCC CCCATGACAA TGACTTATGA ATTTGAAAGG 480 

CAAAACCTAG GTTTAATCCA TTCTAATTTA GATAAAATCG CTCAAACCTA TAAAGAAAAT 54 0 

GCGATGTTTA TTCCTATAGA ATTATCTATT GTTAATTCTC TTAAAGCATT GGAAAATTTA 600 

CAATTAGCAA GCAAAGAAAA AGAATTGATT GAAATCCTAC AATGTTTCAA CCCTAATATT 66 0 

TTAAATGCTA ATACAATAAG AAAGTCTGTC TATATCCAAA TCAAAGATGA AAACACACCG 720 

CTAGAAGAAA GTCCCAAAAG GCTTTTAAAT TTGTTTGGTT GGGGTTTTAT CAAATTCTTT 780 

ATTATGGTGA GCATTCTTAT AGACAATCGT GTCAAGTATC TTTTTATTGA TGJVAATAGAA 840 

AGCGGTTTGC ACCATACAAA AATGCAAGAG TTTTTAAAAG CTCTGTTTAA GTTAGCTCAA 900 

AAATTACAGA TTCAAATTTT TGCCACCACG CACAATAAGG TUVTTTTTATT AAACGCCATC 960 

AACACGATAT CCGATAATGA AACGGGAGTT TTTAAAGACA TAGCCTTGTT TGAGCTTGAA 1020 

AAAGAAAGCG CTTCTGGCTT TATCAGACAC AGCTATTCTA TGCTAGAAAA AGCGCTTTAT 1080 
AGGGGTATGG AGGTTAGAGG CTGA 



1104 



(2) INFORMATION FOR SEQ ID NO: 70: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1230 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...1230 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 



ATGTCCTTGA TTAGAGTGAA TGGGGAAGCT TTTAAACTCT CTTTGGAAAG TTTAGAAGAA 



60 



wo 98/18323 



PCT/US97/19575 



- 141 - 

GATCCTTTTG AAACTAAAGA AACGCTAGAA ACGCTAGAAA CGCTTATCAA ACAAACGAGC 12 0 

GTTGTTTTAT TGGCCGCTGG GGAGTCTAAG CGTTTTTCTC GTGCGATTAA AAAGCAGTGG 180 

CTACGCTCTC ACCACACCCC CTTATGGCTC AGCGTGTATG AAAGCTTTAA AGAAGCCCTA 240 

GACTTTAAGG AAGTCATTCT AGTTGTAAGC GAATTGGATT ATGTTTATAT CCAACGCCAT 300 

TACCCCAAAA TCAAGCTTGT AAAAGGCGGG GCATCAAGGC AAGAATCCGT GCGTAACGCT 360 

TTGAAAGTAA TTGATAGCAC TTACACGATC ACCAGCGATG TGGCTAGGGG TTTAGCGAAT 420 

ATGGAAGCGC TTAAAAGCTT GTTTTTAACC CTCCAACAAA CGAGCCATTA TTGCATCGCC 480 

CCTTACTTGC CTTGCTATGA CACAGCGATC TATTATAACG AGGCTTTAGA TAGAGAAGCG 54 0 

ATCAAACTCA TTCAAACCCC GCAATTAAGC CACACCAAAA CGCTCCAATC AGCCCTAAAC 600 

CAAGGGGGTT TTAAAGATGA AAGCAGCGCG ATTTTACAAG CTTTCCCTAA CTCTGTGAGC 660 

TATATTGAAG GCAGTAAGGA TTTGCACAAA CTCACCACAA GCGGCGATTT AAAGTTTTTT 720 

ACGCCTTTTT TTAACCCAGC AAAGGACACT TTTATAGGCA TGGGTTTTGA TACGCATGCG 780 

TTCATTAAAG ATAAGCCTAT GGTTTTAGGG GGGGTTGTTT TGGATTGCGA GTTTGGGTTA 840 

AAGGCTCATA GCGATGGCGA TGCTTTATTG CATGCGGTTA TTGATGCGAT TTTAGGAGCG 900 

ATTAAAGGGG GGGATATTGG CGAATGGTTC CCTGATAATG ACCCCAAATA CAAAAACGCC 960 

-TGT-TGTAAAG -AGGTTTTAAA— AATCGTGTTG GATTTTTCTC"AAAGCATTGG GTTTGAATTG 1020 

CTTGAAATGG GAGCGACCAT CTTTAGCGAA ATCCCTAAAA TCACTCCTTA CAAACCGGCG 1080 

ATTTTAGAGA ATTTGAGCCA ACTTTTGGGT TTAGAAAAAT CTCAAATCAG CTTGAAA6CC 114 0 

ACTACAATGG AAAAAATGGG GTTCATTGGC AAACAAGAAG GGCTGTTAGT CCAAGCGCAT 12 00 

GTGAGCATGC GTTATAAACA AAAACTTTAA 123 0 

(2) INFORMATION FOR SEQ ID NO: 71; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION l..,813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

ATGAAAAAGT TTGTAGCTTT AGGGCTTCTA TCCGCGGTTT TAAGCTCTTC GTTGTTAGCC 60 

GAAGGTGATG GTGTTTATAT AGGGACTAAT TATCAGCTTG GACAAGCCCG TTTGAATAGC 120 

AATATTTATA A,TACAGGGGA TTGCACAGGG AGTGTTGTAG GTTGCCCCCC AGGTCTTACC 180 

GCTAATAAGC ATAATCCAGG AGGCACCAAT ATCAATTGGC ACTCCAAATA CGCTAATGGG 240 

GCTTTGAATG GTTTTGGGTT GAATGTGGGT TATAAGAAAT TCTTCCAATT CAAGTCGCTA 300 

GATATGACAA GCAAGTGGTT TGGTTTTAGA GTGTATGGGC TTTTTGATTA CGGGCATGCC 360 

GATTTAGGTA AACAAGTTTA TGCACCTAAT AAAATCCAGT TGGATATGGT CTCTTGGGGT 420 

GTGGGGAGCG ATTTGTTAGC TGATATTATT GATAAAGACA ACGCTTCTTT TGGTATTTTT 480 

GGTGGGGTCG CTATCGGCGG TAACACTTGG AAAAGCTCTG CAGCAAACTA TTGGAAAGAG 540 

CAAATCATTG AAGCCAAAGG TCCTGATGTT TGTACCCCTA CTTATTGTAA CCCTAATGCC 600 

CCTTATAGCA CCAACACTTC AACCGTCGCT TTTCAAGTGT GGTTGAATTT TGGGGTGAGA 660 
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GCCAATATCT ACAAGCATAA TGGCGTGGAA TTTGGCGTGA GAGTGCCGCT ACTCATCAAT 
AAATTTTTGA GCGCGGGTCC TAACGCTACT AACCTTTATT ACCATTTGAA ACGGGATTAT 
TCGCTTTATT TGGGGTATAA CTACACTTTT TAA 

5 (2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1317 base pairs 

(B) TYPE: nucleic acid 
IQ (C) STRANDEDNESS : double 

{ D ) TOPOLOGY : c i r cul a r 

(ii) MOLECtJLE TYPE: DNA (genomic) 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Helicobacter pylori 



25 



(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...1317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 



ATGGCTTACA AACCTAACAA AAAGAAGTTA AAAGAATTAA GAGAGCAACC GAATTTATTT 
AGCATCTTAG ATAAGGGCGA TGTTGCAACA AACAATCCTG TTGAAGAGTC AGACAAGGCC 

30 AATAAAATAC AAGAGCCACT CCCTTATGTC GTGAAAACGC AAATCAATAA AGCAAGCATG 
ATTTCTAGAG ATCCTATTGA ATGGGCAAAG TATTTAAGCT TTGAAAAACG AGTCTATAAG 
GATAATAGTA AAGAAGATGT CAATTTCTTT GCCAATGGTG AGATAAAAGA AAGTTCTCGT 
GTTTATGAAG CGAATAAAGA AGGGTTTGAA AGGCGCATCA CTAAAAGATA CGATCTGATT 
GATAGAAATA TTGATAGAAA TAGAGAATTT TTTATAAAAG AAATTGAAAT TCTAACCCAC 

35 ACAAACAGCT TAAAAGAATT GAAAGAGCAA GGGTTAGAAA TCCAATTGAC CCACCATAAT 
GAAACGCATA AGAAAGCCTT AGAAAATGGC AATGAAATCG TTAAAGAATA CGACCATCTT 
AAAGATATTT ACCAAGAAGT AGAAAGAACA AAAGATGGTG GATTGGTAAG AGAAATAATC 
CCCAGTATTT CTAGCGCTGA GTATTTCAAG CTTTACAACA AACTGCCTTT TGAATCAATA 
AACAATGAAA ATACCAAACT GAATACTAAC GACAATGAAG AAGTTAAAAA ACTAGAATTT 

40 GAATTAGCTA AAGAAGTGCA TATTTTAATC CTAGAGCAAC AATTGCTTTC AGCAACAAAT 
TATTATTCTT GGATAGATAA AGATGATAAT GCGAATTTTG CTTGGAAAAT GCATAGGCTT 
ATCAATGAAA ATAAACTCAA AGAAAACCAT CTCAGCGCCA ATAACGCTAA TAAGATTAAG 
CAATTTTTCT TTAATAATGG TTCTATTTTA GGCTGGACTA AAGAAGAACA AAGCGCTATA 
CAAGAAAACA GAGATTATTC TTTAAGAAGC GCTCTTTTAA GTTTAGAAGA AATCGCTCAA 

45 GCAAAAATTG AATTGCAAAA ATACTATGAA AGCGTTTATG TTAATGGTGA TGGGAATAAA 
AGAGAAATCA AGCCTTTTAA AGAAATTTTA AGAGACACCA ACAATTTTGA AAAAGCTTAT 
AAGGAGCGTT ATGACAAATT GGTAAGCTTG AGTGCAGCAA TCATTCAAGC TAAAGAGGGT 
GGTAATGAGC GACCAAATTC TAGTGCAAAT AACAATAACC CTATTAAAAA TACAATAGAG 
ACTAATACTT CTAACAATAT TATTCAAAAT AATGATAATA TAATCATCCA AATTTAA 



720 
780 
813 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1317 



50 



(2) INFORMATION FOR SEQ ID NO: 73: 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...648 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

ATGCAAGCGT TAAAATCATT GCTTGAAGTG ATTACAAAAC TCCAGAATCT AGGCGGCTAT 60 

TTGATGCATA TAGCTATTTT CATCATTTTT ATTTGGATTG GAGGGCTTAA GTTTGTGCCT 120 

TACGAAGCTG AAGGGATCGC CCCTTTTGTG GCCAACTCCC CTTTCTTTTC TTTCATGTAT 18 0 

AAATTTGAAA AACCTGCATA CAAACAACAC AAAATGTCTG AATCCCAATC CATGC7VAGAA 240 

GAAATGCAAG ATAACCCTAA AATCGTTGAA AACAAAGAAT GGCATAAAGA AAACCGCACT 3 00 

TATTTAGTGG CTGAAGGTTT AGGGATTACG ATCATGATCC TAGGCATTTT GGTGCTTTTG 360 

GGGCTTTGGA TGCCTTTAAT GGGCGTAGTT GGGGGCTTGC TTGTCGCTGG AATGACGATC 420 

ACCACCCTAT CTTTTTTATT CACAACGCCA GAAGTGTTTG TCAATCAGCA TTTCCCATGG 480 

CTTTCTGGGG CTGGAAGGCT AGTGGTTAAA GACTTGGCGT TATTTGCTGG AGGCTTGTTT 54 0 

GTGGCCGGAT TTGATGCGAA ACGCTATTTG GAGGGTAAAG GGTTTTGCTT GATGGACCGC 600 

TCATCGGTAG GGATTAAAAC TAAATGCTCT AGCGGGTGTT GCTCTTAA 648 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 6 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Met lie Lys Arg lie Ala Cys lie Leu Ser Leu Ser Ala Ser Leu Ala 

15 10 15 

Leu Ala Gly Glu Val Asn Gly Phe Phe Met Gly Ala Gly Tyr Gin Gin 

20 25 30 

Gly Arg Tyr Gly Pro Tyr Asn Ser Asn Tyr Ser Asp Trp Arg His Gly 
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10 



15 



20 



45 



50 



55 







35 








40 










45 








Asn 


Asp 
50 


Leu 


Tyr 


Gly Leu 


Asn 
55 


Phe 


Lys 


Leu 


Gly 


Phe 
60 


Val 


Gly 


Phe 


Ala 


Asn 


Lys 


Trp 


Phe 


Gly Ala 


Arg 


Val 


Tyr 


Gly 


Phe 


Leu 


Asp 


Trp 


Phe 


Asn 


65 






70 










75 










80 


Thr 


Ser 


Gly 


Thr 


Glu His 


Thr 


Lys 


Thr 


Asn 


Leu 


Leu 


Thr 


Tyr Gly Gly 






85 








90 










95 




Gly 


Gly 


Asp 


Leu 


lie Val 


Asn 


Leu 


He 


Pro 


Leu 


Asp 


Lys 


Phe 


Ala 


Leu 






100 








105 










110 






Gly 


Leu 


lie 


Gly Gly Val 


Gin 


Leu 


Ala 


Gly 


Asn 


Thr 


Trp 


Met 


Phe 


Pro 




115 








120 










125 








Tyr 


Asp 


Val 


Asn 


Gin Thr 


Arg 


Phe 


Gin 


Phe 


Leu 


Trp 


Asn 


Leu 


Gly 


Gly 


130 








13 5 










140 








Phe 


Arg 


Met 


Arg 


Val 


Gly Asp 


Arg 


Ser 


Ala 


Phe 


Glu 


Ala 


Gly 


Val 


Lys 


145 








150 










155 










160 


Pro 


Met 


Val 


Asn 


Gin Gly 
165 


Ser 


Lys 


Asp 


Val 
170 


Gly 


Leu 


He 


Arg 


Tyr 
175 


Tyr 


Ser 


Trp 


Tyr 


Val 
180 


Asp Tyr 


Val 


Phe 


Thr 
185 


Phe 















(2 ) INFORMATION FOR SEQ ID NO : 75 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 
25 (B) TYPE: amino acid 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

35 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .116 



40 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 75: 












Leu 


Met 


Arg 


He 


He 


He 


Arg 


Leu 


Leu 


Ser 


Phe 


Lys 


Met 


Asn 


Ala 


Phe 


1 






5 










10 










15 




Leu 


Lys 


Leu 


Ala 


Leu 


Ala 


Ser 


Leu 


Met 


Gly 


Gly 


Leu 


Trp 


Tyr 


Ala 


Phe 






20 










25 










30 






Asn 


Gly 


Glu 


Gly 


Ser 


Glu 


He 


Val 


Ala 


He 


Gly 


He 


Phe 


Val 


Leu 


He 




35 










40 










45 








Leu 


Phe 


Val 


Phe 


Phe 


He 


Arg 


Pro 


Val 


Ser 


Phe 


Gin 


Asp 


Pro 


Glu 


Lys 




50 










55 










60 










Arg 


Glu 


Glu 


Tyr 


He 


Glu 


Arg 


Leu 


Lys 


Lys 


Asn 


His 


Glu 


Arg 


Lys 


Met 


65 










70 










75 










80 


He 


Leu 


Gin 


Asp 


Lys 


Gin 


Lys 


Glu 


Glu 


Gin 


Met 


Arg 


Leu 


Tyr 


Gin 


Ala 








85 










90 










95 




Lys 


Lys 


Glu 


Arg 


Glu 


Ser 


Arg 


Gin 


Lys 


Gin 


Asp 


Leu 


Lys 


Glu 


Gin 


Met 




100 










105 










110 






Lys 


Lys 


Tyr 


Ser 
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115 

(2) INFORMATION FOR SEQ ID NO: 76: 

5 (i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Helicobacter pylori 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1.,.345 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 



Met Val Lys His Tyr Leu Phe Met Ala Val Ser Gin Val Phe Phe Ser 
15 10 15 

25 Phe Phe Leu Val Leu Phe Phe lie Ser Ser lie Val Leu Leu lie Ser 
20 25 30 

lie Ala Ser Val Thr Leu Val lie Lys Val Ser Phe Leu Asp Leu Val 

35 40 45 

Gin Leu Phe Leu Tyr Ser Leu Pro Gly Thr lie Phe Phe lie Leu Pro 
30 50 55 60 

lie Thr Phe Phe Ala Ala Cys Ala Leu Gly Leu Ser Arg Leu Ser Tyr 
65 70 75 80 

Asp His Glu Leu Leu Val Phe Phe Ser Leu Gly Val Ser Pro Lys Lys 
85 90 95 

35 Met Thr Lys Ala Phe Val Pro Leu Ser Leu Leu Val Ser Ala lie Leu 
100 105 110 

Leu Ala Phe Ser Leu lie Leu lie Pro Thr Ser Lys Ser Ala Tyr Tyr 

115 120 125 

Gly Phe Leu Arg Gin Lys Lys Asp Lys lie Asp lie Asn lie Arg Ala 
40 130 135 140 

Gly Glu Phe Gly Gin Lys Leu Gly Asp Trp Leu Val Tyr Val Asp Lys 
145 150 155 160 

Thr Glu Asn Asn Ser Tyr Asp Asn Leu Val Leu Phe Ser Asn Lys Ser 
165 170 175 

45 Leu Ser Gin Glu Ser Phe lie Leu Ala Gin Lys Gly Asn lie Asn Asn 
180 185 190 

Gin Asn Gly Val Phe Glu Leu Asn Leu Tyr Asn Gly His Ala Tyr Phe 

195 200 205 

Thr Gin Gly Asp Lys Met Arg Lys Val Asp Phe Glu Glu Leu His Leu 
50 210 215 220 

Arg Asn Lys Leu Lys Ser Phe Asn Ser Asn Asp Ala Ala Tyr Leu Gin 
225 230 235 240 

Gly Thr Asp Tyr Leu Gly Tyr Trp Lys Lys Ala Phe Gly Lys Asn Ala 
245 250 255 

55 Asn Lys Asn Gin Lys Arg Arg Phe Ser Gin Ala lie Leu Val Ser Leu 
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260 

Phe Pro Leu Ala 
275 

Pro Arg Phe Lys 
5 290 

Gly Val Tyr Ptie 
305 

Met Thr Phe Phe 

10 Phe Arg Lys Phe 
340 



Ser Val Phe Leu 
280 

Thr Asn Trp Ser 
295 

Leu Met Val His 
310 

Phe Pro Phe lie 
325 

lie Leu Lys Arg 
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265 

He Pro Leu Phe 

Tyr Phe Tyr Val 
300 

Val He Ser Thr 
315 

Trp Ala Phe He 
330 

Tyr 
345 



270 

Gly He Ala Asn 
285 

Leu Gly Ala Val 

Asp Leu Phe Leu 
320 

Ser Tyr Leu Leu 
335 



(2) INFORMATION FOR SEQ ID NO: 77: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1. , .276 



30 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 



Met 


Lys 


Lys 


Lys 


Ala 


Lys 


Val 


Phe 


Trp 


Cys 


Cys 


Phe 


Lys 


Met 


He 


1 




5 










10 










15 


Trp 


Leu 


Tyr 


Leu 


Ala 


Val 


Phe 


Phe 


Leu 


Leu 


Ser 


Val 


Ser 


Asp 


Ala 




20 










25 










30 




Glu 


He 


Ala 
35 


Met 


Gin 


Arg 


Phe 


Asp 
40 


Lys 


Gin 


Asn 


His 


Lys 
45 


He 


Phe 


He 


Leu 


Ala 


Asp 


Lys 


Val 


Ser 


Ala 


Lys 


Asp 


Asn 


Val 


He 


Thr 


Ala 




50 










55 










60 




Ala 




Gly 


Asn 


Ala 


He 


Leu 


Leu 


Asn 


Tyr 


Asp 


Val 


Tyr 


He 


Leu 


Asp 


65 










70 










75 






Gly 




Val 


Arg 


Tyr 


Asp 


Thr 


Lys 


Thr 


Lys 


Glu 


Ala 


Leu 


Leu 


Glu 


Asn 








85 










90 










95 


Lys 


Val 


Tyr 


Arg 


Gly 


Glu 


Gly 


Leu 


Leu 


Val 


Lys 


Thr 


Asp 


Tyr 


Val 






100 










105 










110 




Leu 


Ser 


Leu 


Asn 


Glu 


Lys 


Tyr 


Glu 


He 


He 


Phe 


Pro 


Phe 


Tyr 


Val 






115 










120 










125 




Gly 


Asp 


Ser 


Val 


Ser 


Gly 


He 


Trp 


Val 


Ser 


Ala 


Asp 


He 


Ala 


Ser 


130 










135 










14 0 








Asp 


Gin 


Lys 


Tyr 


Lys 


He 


Lys 


Asn 


Met 


Ser 


Ala 


Ser 


Gly 


Cys 


Ser 


145 










150 










155 










Asp 


Asn 


Pro 


He 


Trp 


His 


Val 


Asn 


Ala 


Thr 


Ser 


Gly 


Ser 


Phe 


Asn 








165 










170 










175 


Gin 


Lys 


Ser 


His 


Leu 


Ser 


Met 


Trp 


Asn 


Pro 


Lys 


He 


Tyr 


Val 


Gly 



80 



160 
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180 

lie Pro Val Leu 
195 

Arg Thr Thr Gly 
5 210 

Gly Phe lie Tyr 
225 

Asp Met Thr Phe 

10 Asn Phe Glu Ala 
260 

Cys Ala Leu Phe 
275 



Tyr Leu Pro Tyr 
200 

Phe Leu Tyr Pro 
215 

Leu Gin Pro Phe 
230 

Thr Pro Gin lie 
245 

Arg Tyr lie Asn 



185 

lie Phe Met Ser 

Glu Phe Gly Thr 
220 

Tyr Leu Ala Pro 
235 

Arg Tyr Lys Arg 
250 

Ser Lys Thr Gin 
265 



190 

Thr Ser Asn Lys 
205 

Ser Asn Leu Asp 

Lys Asn Ser Trp 
24 0 

Gly Phe Gly Leu 
255 

Val Phe lie Gin 
270 



15 (2) INFORMATION FOR SEQ ID NO: 78 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
30 (A) NAME/KEY: niisc_f eature 

(B) LOCATION 1...224 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



35 Met lie Arg 

1 ■ 

Ala Gly Val 

Leu Ser Asp 
40 35 
Glu Asp Thr 
50 

Leu Asn Ser 
65 

45 Glu Lys lie 

Asn Leu lie 

Lys lie Lys 
50 115 
Met Ala Phe 

130 
Gin Leu Gin 
145 

55 Phe lie Asp 



Leu Lys Gly Leu 
5 

Leu Leu Gly Ala 
20 

Glu Asp Leu Leu 

Leu Thr Ser Cys 
55 

Gly Trp Asn Leu 
70 

Phe Glu Cys Val 
85 

Asn Lys Glu Asp 
100 

Glu lie Lys Glu 

Glu Met Lys Glu 
135 

Thr Met Leu Glu 
150 

Asp Trp His Glu 



Asn Lys Thr Leu Lys 
10 

Thr Ala Pro Leu Met 
25 

Lys Arg Val Lys Leu 
40 

Asn Ala Lys Val Asp 
60 

Ser Lys Glu Phe Pro 
75 

Glu Glu Glu Lys His 
90 

Thr Lys Asp Lys Glu 
105 

Lys Ala Lys Val Leu 
120 

His Ser Lys Glu Phe 
140 

Asn Ala Phe Asp Asn 
155 

Arg Phe Gly Gly lie 



Thr Ser Leu Leu 
15 

Ala Lys Pro Leu 
30 

His Asn lie Lys 
45 

Gly Ser Gin Tyr 

Gin Glu Tyr Arg 
80 

Lys Gin Ala Leu 
95 

Glu Leu Ala Lys 
110 

Arg Gin Lys Phe 
125 

Pro Asn Lys Lys 

Gly Ala Glu Ser 
160 

Ser Arg Glu Asn 
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165 170 175 

Thr Tyr Lys Ala Leu Gly lie Lys Glu Tyr Ser Asp Glu Gly Lys lie 

180 185 190 

Leu Pro Leu Ala Lys Glu Val lie Leu Asp Asn lie Lys Lys lie Leu 

195 200 205 

Lys Lys Ala Leu Met lie Leu Asp Asn Pro Tyr Leu Leu Trp Leu Val 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc^f eature 
25 (B) LOCATION 1...429 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



30 



10 



15 



35 



40 



45 



50 



55 



Met 


Pro 


Tyr 


Ala 


Leu 


Arg 


Lys 


Arg 


Phe 


Phe 


Lys 


Arg 


Leu 


Leu 


Leu 


Phe 


1 






5 










10 










15 


Phe 


Phe 


Leu 


He 


Val 


Cys 


Met 


He 


Asn 


Leu 


His 


Ala 


Lys 


Ser 


Tyr 


Leu 








20 










25 










30 






Ser 


Pro 


Leu 


Pro 


Pro 


Ala 


His 


Gin 


Gin 


He 


He 


Lys 


Thr 


Glu 


Pro 


Cys 






35 










40 










45 








Ser 


Leu 


Glu 


Cys 


Leu 


Lys 


Asp 


Leu 


Met 


Leu 


Gin 


Asn 


Gin 


He 


Phe 


Ser 




50 










55 










60 








Thr 


Phe 


val 


Ser 


Gin 


Tyr 


Asp 


Asp 


Asn 


Asn 


Gin 


Asp 


Glu 


Ser 


Leu 


Lys 


65 










70 










75 










80 


Tyr 


Tyr 


Lys 


Asp 


He 


Leu 


Asn 


Lys 


Leu 


Asn 


Pro 


Val 


Phe 


He 


Ala 


Ser 








85 










90 










95 




Gin 


Thr 


Pro 


Ala 


Lys 


Glu 


Ser 


Tyr 


Glu 


Pro 


Lys 


He 


Glu 


Leu 


Ala 


He 








100 










105 










110 






Leu 


Leu 


Pro 


Lys 


Lys 


Val 


Val 


Gly 


Arg 


Tyr 


Ala 


He 


Leu 


Val 


Met 


Asn 






115 










120 










125 








Thr 


Leu 


Leu 


Ala 


Tyr 


Leu 


Asn 


Thr 


Arg 


Asn 


Asn 


Asp 


Phe 


Asn 


He 


Gin 




130 










i35 










140 










Val 


Phe 


Asp 


Ser 


Asp 


Glu 


Glu 


Ser 


Pro 


Glu 


Lys 


Leu 


Glu 


Glu 


Thr 


Tyr 


145 










150 










155 










160 


Lys 


Glu 


He 


Glu 


Lys 


Glu 


Lys 


Phe 


Pro 


Phe 


He 


He 


Ala 


Leu 


Leu 


Thr 








165 










170 










175 




Lys 


Glu 


Gly 


Val 


Glu 


Asn 


Leu 


Leu 


Gin 


Asn 


Thr 


Thr 


He 


Asn 


Thr 


Pro 






180 










185 










190 






Thr 


Tyr 


Val 


Pro 


Thr 


Val 


Asn 


Lys 


Thr 


Gin 


Leu 


Glu 


Asn 


His 


Thr 


Glu 




195 










200 










205 








Leu 


Ser 


Leu 


Ser 


Glu 


Arg 


Leu 


Tyr 


Phe 


Gly 


Gly 


He 


Asp 


Tyr 


Lys 


Glu 
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210 215 220 

Gin Leu Gly Met Leu Ala Thr Phe lie Ser Pro Asn Ser Pro Val lie 
225 230 235 240 

Glu Tyr Asp Asp Asp Gly Leu He Gly Glu Arg Leu Arg Gin He Thr 
5 245 250 255 

Glu Ser Leu Asn Val Glu Val Lys His Gin Glu Asn He Ser Tyr Lys 

260 265 270 

Gin Ala Thr Ser Phe Ser Lys Asn Phe Arg Lys His Asp Ala Phe Phe 
275 280 285 

10 Lys Asn Ser Thr Leu lie Leu Asn Thr Pro Thr Thr Lys Ser Gly Leu 
290 295 300 

He Leu Ser Gin He Gly Leu Leu Glu Tyr Lys Pro Leu Lys He Leu 
305 310 315 320 

Ser Thr Gin He Asn Phe Asn Pro Ser Leu Leu Leu Leu Thr Gin Pro 
15 325 330 335 



30 



35 



Lys Asp^Arg^ Lys""Asn Leu/ Phe^ Ile Val Asn Ala Leu Gin Asn Ser Asp 

340 345 350 

Glu Thr Leu He Glu Tyr Ala Ser Leu Leu Glu Ser Asp Leu Arg His 
355 360 365 

20 Asp Trp Val Asn Tyr Ser Ser Ala He Gly Leu Glu Met Phe Leu Asn 
370 375 380 

Thr Leu Asp Pro His Phe Lys Lys Ser Phe Gin Glu Ser Leu Glu Asp 
385 390 395 400 

Asn Gin Val Arg Tyr His Asn Gin He Tyr Gin Ala Leu Gly Tyr Ser 
25 405 410 415 

Phe Glu Pro He Lys Asn Glu Ser Glu Thr Lys Lys Glu 
420 425 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
45 (B) LOCATION 1. . .455 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Val Leu Lys Phe Gin Lys Leu Pro Leu Leu Phe Val Ser He Leu Tyr 
50 1 5 10 15 

Asn Gin Ser Pro Leu Leu Ala Phe Asp Tyr Lys Phe Ser Gly Val Ala 

20 25 30 

Glu Ser Val Ser Lys Val Gly Phe Asn His Ser Lys Leu Asn Ser Lys 
35 40 45 

55 Glu Gly He Phe Pro Thr Ala Thr Phe Val Thr Ala Thr He Lys Leu 
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50 

Gin Val Asp 
65 

Lys lie Gly 

5 

Lys Thr Leu 

Phe Tyr Leu 
115 

10 Lys Asp Ser 
130 

Tyr Asn Ser 
145 

Leu Gly Arg 

15 

Gly Phe Glu 

Phe Ser Ser 
195 

20 Trp Tyr Ala 
210 
Gly lie His 
225 

Met Pro Phe 

25 

Lys He His 

Ala Gin Thr 
275 

30 Tyr Asp Val 
290 
Leu Leu He 
305 

Gly Tyr Tyr 

35 

Gly Asn Pro 

Val Phe Ser 
355 

40 Gly Gly Gly 
370 
Thr Tyr Ala 
385 

Tyr Lys Trp 

45 

Val Val Ser 

Phe Asn Lys 
435 

50 Val Ser Met 
450 



55 

Ser Asn Leu Leu 
70 

Val Gly Gly He 
85 

He Asp Gin Ala 
100 

He Gly Arg Trp 

Leu He Glu Ser 
135 

Tyr Leu Phe Tyr 
150 

Tyr Leu Ser Asn 
165 

Leu Asp Tyr Lys 
180 

Phe Gly Arg Ala 

Pro He Val Thr 
215 

Ala Ala Gin Leu 
230 

Ala Tyr Phe Ser 
245 

He Asp Ser Asn 
260 

Thr He Asn Val 

Tyr Trp Arg Asn 
295 

His Gin Arg Phe 
310 

Gin Asn Phe Gly 
325 

He Pro Phe Asn 
340 

Asn Ala He Thr 

Val Tyr Arg Gly 
375 

Thr Arg Ala Ser 
390 

Gly Ser Phe Ala 
405 

Met His Asn Gly 
420 

Ala Phe Lys Ala 

Lys Phe Phe Phe 
455 



-150- 

60 

Pro Lys Asn He Glu 
75 

Leu Gly Ala Leu Ala 
90 

Thr His Gin He Tyr 
105 

Trp Gly Phe Leu Gly 
120 

Asp Ala His Thr Arg 
140 

Ser Tyr Gly Asp Lys 
155 

Met Asp Phe Met Ser 
170 

He Asn Ser Lys He 
185 

Leu Ala Phe Gly Gin 
200 

Glu Asp Gly Arg Lys 
220 

Tyr Phe Ser Ser Lys 
235 

Pro Lys He Tyr Gly 
250 

Pro Lys Phe Lys Gly 
265 

He Phe Pro Val Tyr 
280 

Ser Lys He Gly Glu 
300 

Asp Tyr Asn Glu Phe 
315 

Asn Ala Asn Ala Arg 
330 

Tyr Arg Asn Asn Ser 
345 

Ala Asp Ala Val Ser 
360 

Phe Leu Trp Gly He 
380 

Glu Arg Ser He Asn 
395 

Arg Val Asp Val Asn 
410 

Tyr Arg Leu Asp Tyr 
425 

Asp Ala Gin Asp Arg 
440 



Lys His Ser Leu 
80 

Tyr Asp Ser Thr 
95 

Gly Ser Glu Leu 
110 

Asn Ala Pro Trp 
125 

Asn Tyr Val Leu 

Phe His Leu Lys 
160 

Ser Tyr Thr Gin 
175 

Ala Leu Lys Trp 
190 

Trp He Arg Asp 
205 

Glu Val Tyr Asp 

His Val Gin Val 
240 

Ala Pro Gly Val 
255 

Leu Gly Leu Arg 
270 

Ala Lys Asp Leu 
285 

Trp Gly Ala Ser 

Asn Phe Gly Phe 
320 

He Gly Trp Tyr 
335 

Val Tyr Gly Gly 
350 

Gly Tyr Val Phe 
365 

Leu Gly Arg Tyr 

Leu Asn Leu Gly 
400 

Leu Glu Tyr Tyr 
415 

Leu Thr Gly Pro 
430 

Ser Asn Leu Met 
445 



(2) INFORMATION FOR SEQ ID NO: 81: 



55 



(i) SEQUENCE CHARACTERISTICS: 



wo 98/18323 



PCT/US97/19575 



- 151 - 

(A) LENGTH: 282 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
0 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .282 



5 



(xi) SEQUENCE" DESCRIPTION: SEQ ID NO: Si- 



Met Gly Cys Ser Phe lie Phe Lys Lys Val Arg Val Tyr Ser Lys Met 
1 5 10 15 

20 Leu Val Ala Leu Gly Leu Ser Ser Val Leu lie Gly Cys Ala Met Asn 
20 25 30 

Pro Ser Ala Glu Thr Lys Lys Pro Asn Asp Ala Lys Asn Gin Gin Pro 

35 40 45 

Val Gin Thr His Glu Arg Met Thr Thr Ser Ser Glu His Val Thr Pro 
25 50 55 60 

Leu Asp Phe Asn Tyr Pro Val His lie Val Gin Ala Pro Gin Asn His 
65 70 75 80 

His Val Val Gly lie Leu Met Pro Arg lie Gin Val Ser Asp Asn Leu 
85 90 95 

30 Lys Pro Tyr lie Asp Lys Phe Gin Asp Ala Leu lie Asn Gin lie Gin 
100 105 110 

Thr lie Phe Glu Lys Arg Gly Tyr Gin Val Leu Arg Phe Gin Asp Glu 

115 120 125 

Lys Ala Leu Asn Val Gin Asp Lys Lys Lys lie Phe Ser Val Leu Asp 
35 130 135 140 

Leu Lys Gly Trp Val Gly lie Leu Glu Asp Leu Lys Met Asn Leu Lys 
145 150 155 160 

Asp Pro Asn Ser Pro Asn Leu Asp Thr Leu Val Asp Gin Ser Ser Gly 
165 170 175 

40 Ser Val Trp Phe Asn Phe Tyr Glu Pro Glu Ser Asn Arg Val Val His 
180 185 190 

Asp Phe Ala Val Glu Val Gly Thr Phe Gin Ala lie Thr Tyr Thr Tyr 

195 200 205 

Thr Ser Thr Asn Asn Ala Ser Gly Gly Phe Asn Ser Ser Lys Ser Val 

210 215 220 

lie His Glu Asn Leu Asp Lys Asn Arg Glu Asp Ala lie His Lys lie 
225 230 235 240 

Leu Asn Arg Met Tyr Ala Val Val Met Lys Lys Ala Val Thr Glu Leu 

245 250 255 

Thr Lys Glu Asn lie Ala Lys Tyr Arg Asp Ala lie Asp Arg Met Lys 

260 265 270 

Gly Phe Lys Ser Ser Met Pro Gin Lys Lys 
275 280 



(2) INFORMATION FOR SEQ ID NO: 82: 
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10 



25 



30 



35 



40 



45 



50 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
15 (A) NAME/KEY: misc_feature 

(B) LOCATION 1...280 



20 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NC 


: 82 : 












Met 


Lvs 


Leu 


Arg 


Ala 


Ser 


Val 


Leu 


j.xe 


Gly 


Val 


Ala 


He 


Leu 


Cys 


Leu 


1 






5 










10 










15 




lie 


Leu 


Ser 


Ala 


Cys 


Ser 


Asn 


Tyr 




Lys 


Lys 


Val 


Val 


Lys 


Gin 


Lys 








20 










25 










30 






Asn 


His 


Val 


Tyr 


Thr 


Pro 


Val 


Tyr 


Asn 


Glu 


Leu 


He 


Glu 


Lys 


Tyr 


Ser 






35 




















45 






Gin 


Glu 


He 


Pro 


Leu 


Asn 


Asp 


Lys 




Lys 


Asp 


Thr 


Pro 


Phe 


Met 


Val 




50 










55 










60 








Val 


Val 


Lys 


Leu 


Pro 


Asn 


Tyr 


Lys 


Asp 


Tyr 


Leu 


Leu 


Asp 


Asn 


Lys 


Gin 


65 








70 










75 










80 


Val 


Leu 


Thr 


Phe 


Lys 


Leu 


val 


His 


His 


Ser 


Lys 


Lys 


He 


Thr 


Leu 


He 










85 










90 










95 




Gly 


Asp 


Ala 


Asn 


Lys 


He 


Leu 


Gin 


Tyr 


Lys 


Asn 


Tyr 


Phe 


Gin 


Ala 


Asn 




100 










105 










110 




Gin 


Gly 


Ala 


Arg 


Ser 


Asp 


He 


Asp 


Phe 


Tyr 


Leu 


Gin 


Pro 


Thr 


Leu 


Asn 




115 










120 










125 








Lys 


Gly 


Val 


Val 


Met 


He 


Ala 


Ser 


Asn 


Tyr 


Asn 


Asp 


Asn 


Pro 


Asn 


Asn 


130 










135 










140 










Lys 


Glu 


Lys 


Pro 


Gin 


Thr 


Phe 


Asp 


Val 


Leu 


Gin 


Gly 


Ser 


Gin 


Pro 


Met 


14 5 








150 










155 










160 


Leu 


Gly 


Ala 


Asn 


Thr 


Lys 


Asn 


Leu 


His 


Gly Tyr 


Asp 


Val 


Ser Gly Ala 








165 










170 










175 




Asn 


Asn 


Lys 


Gin 


Val 


He 


Asn 


Glu 


Val 


Ala 


Arg 


Glu 


Lys 


Ala 


Gin 


Leu 






180 










185 










190 




Gin 


Glu 


Lys 


He 


Asn 


Gin 


Tyr 


Tyr 


Lys 


Thr 


Leu 


Leu 


Gin 


Asp 


Lys 


Glu 




195 










200 










205 








Glu 


Tyr 


Thr 


Thr 


Arg 


Lys 


Asn 


Asn 


Gin 


Arg 


Glu 


He 


Leu 


Glu 


Thr 


Leu 




210 










215 










220 








Glu 


Ser 


Asn 


Arg 


Ala 


Gly 


Tyr 


Gin 


Met 


Arg 


Gin 


Asn 


Val 


He 


Ser 


Ser 


225 










230 










235 










240 


lie 


Phe 


Lys 


Asn 


Gly 


Asn 


Leu 


Asn 


Met 


Gin 


Ala 


Lys 


Glu 


Glu 


Glu 


Val 








245 










250 










255 


Gin 


Arg 


Glu 


Lys 


Leu 


Gin 


Glu 


Glu 


Arg 


Glu 


Asn 


Glu 


Tyr 


Leu 


Arg 


Asn 




260 










265 










270 






He 


Arg 


Ser 


Leu 


Leu 


Ser 


Gly 


Lys 






















275 










280 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

~( IX)' FEATURET " 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1,..393 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Met Arg Lys Leu Phe lie Pro Leu Leu Leu Phe Ser Ala Leu Glu Ala 
15 10 15 

Asn Glu Lys Asn Gly Phe Phe lie Glu Ala Gly Phe Glu Thr Gly Leu 
25 20 25 30 

Leu Glu Gly Thr Gin Thr Gin Glu Lys Arg His Thr Thr Thr Lys Asn 

35 40 45 

Thr Tyr Ala Thr Tyr Asn Tyr Leu Pro Thr Asp Thr lie Leu Lys Arg 
50 55 60 

30 Ala Ala Asn Leu Phe Thr Asn Ala Glu Ala lie Ser Lys Leu Lys Phe 
65 70 75 80 

Ser Ser Leu Ser Pro Val Arg Val Leu Tyr Met Tyr Asn Gly Gin Leu 

85 90 95 

Thr lie Glu Asn Phe Leu Pro Tyr Asn Leu Asn Asn Val Lys Leu Ser 
35 100 105 110 

Phe Thr Asp Ala Gin Gly Asn Val lie Asp Leu Gly Val lie Glu Thr 

115 120 125 

lie Pro Lys His Ser Lys lie Val Leu Pro Gly Glu Ala Phe Asp Ser 
130 135 140 

40 Leu Lys lie Asp Pro Tyr Thr Leu Phe Leu Pro Lys lie Glu Ala Thr 
145 150 155 160 

Ser Thr Ser lie Ser Asp Ala Asn Thr Gin Arg Val Phe Glu Thr Leu 

165 170 175 

Asn Lys lie Lys Thr Asn Leu Val Val Asn Tyr Arg Asn Glu Asn Lys 
45 180 185 190 

Phe Lys Asp His Glu Asn His Trp Glu Ala Phe Thr Pro Gin Thr Ala 

195 200 205 

Glu Glu Phe Thr Asn Leu Met Leu Asn Met lie Ala Val Leu Asp Ser 
210 215 220 

50 Gin Ser Trp Gly Asp Ala lie Leu Asn Ala Pro Phe Glu Phe Thr Asn 
225 230 235 240 

Ser Pro Thr Asp Cys Asp Asn Asp Pro Ser Lys Cys Val Asn Pro Gly 

245 250 255 

Thr Asn Gly Leu Val Asn Ser Lys Val Asp Gin Lys Tyr Val Leu Asn 
55 260 265 270 
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10 



15 



35 



40 



305 



Gin 


Asp 


He 


Val 


Asn 


Lys 


Phe 


Lys 


Asn 


Lys 


Ala 


Asp 


Leu 


Asp 


Val 




275 










280 










285 




He 


Thr 


Val 


Leu 


Lys 


Asp 


Ser 


Gly 


Val 


val 


Gly 


Leu 


Gly 


Ser 


Asp 


290 








295 










300 










Ser 


Asn 


Asn 


Asp 


Asp 


Gly 


Lys 


His 


Tyr 


Gly 


Gin 


Leu 


Gly 


Val 


Val 










310 










315 












Ser 


Ala 


Leu 


Asp 


Pro 


Lys 


Lys 


Leu 


Phe 


Gly 


Asp 


Asn 


Leu 


Lys 


Thr 








325 










330 










335 




Asn 


Leu 


Glu 


Asp 


Leu 


Arg 


Thr 


He 


Leu 


His 


Glu 


Phe 


Ser 


His 


Thr 






340 










345 










350 






Gly 


Tyr 


Gly 


His 


Asn 


Gly 


Asn 


Met 


Thr 


Tyr 


Gin 


Arg 


Val 


Pro 


Val 


355 










360 










365 








Lys 


Asp 


Gly 


Gin 


Val 


Glu 


Lys 


Asp 


Ser 


Asn 


Gly 


Lys 


Pro 


Lys 


Asp 


370 








375 










380 










Asp 


Gly 


Leu 


Pro 


Tyr 


Asn 


Val 


Cys 

















385 390 

(2) INFORMATION FOR SEQ ID NO : 84 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 

{ i i i ) HYPOTHET I CAL : YES 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...270 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



45 50 55 



Met 


Lys 


Lys 


Phe 


Val 


1 








5 


Ser 


Leu 


Leu 


Ala 


Glu 








20 




Leu 


Gly 


Gin 


Ala 


Arg 






35 






Thr 


Gly 


Ser 


Val 


Val 




50 








Asn 


Pro 


Gly 


Gly 


Thr 


65 










Ala 


Leu 


Asn 


Gly 


Phe 










85 


Phe 


Lys 


Ser 


Leu 


Asp 








100 




Gly 


Leu 


Phe 


Asp 


Tyr 






115 






Pro 


Asn 


Lys 


He 


Gin 




130 









Gly 


Leu 


Leu 


Ser 


Ala 


Val 


Leu 


Ser 




10 










15 


Gly 


Val 


Tyr 


He 


Gly 


Thr 


Asn 


Tyr 




25 










30 




Ser 


Asn 


He 


Tyr 


Asn 


Thr 


Gly 


Asp 


40 










45 






Pro 


Pro 


Gly 


Leu 


Thr 


Ala 


Asn 


Lys 










60 








Asn 


Trp 


His 


Ser 


Lys 


Tyr 


Ala 


Asn 








75 










Asn 


Val 


Gly 


Tyr 


Lys 


Lys 


Phe 


Phe 






90 










95 


Ser 


Lys 


Trp 


Phe 


Gly 


Phe 


Arg 


Val 




105 










110 




Ala 


Asp 


Leu 


Gly 


Lys 


Gin 


Val 


Tyr 



70 S° 



50 

100 

Glv Leu Phe ASP Tvr Gly His 

120 125 
Met Val Ser Trp Gly Val 
55 130 " 135 140 
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Leu Leu Ala Asp lie lie Asp Lys Asp Asn Ala Ser Phe Gly lie Phe 
145 150 155 160 

Gly Gly Val Ala lie Gly Gly Asn Thr Trp Lys Ser Ser Ala Ala Asn 
165 170 175 

5 Tyr Trp Lys Glu Gin lie He Glu Ala Lys Gly Pro Asp Val Cys Thr 
180 185 190 

Pro Thr Tyr Cys Asn Pro Asn Ala Pro Tyr Ser Thr Asn Thr Ser Thr 

195 200 205 

Val Ala Phe Gin Val Trp Leu Asn Phe Gly Val Arg Ala Asn He Tyr 
10 210 215 220 

Lys His Asn Gly Val Glu Phe Gly Val Arg Val Pro Leu Leu He Asn 
225 230 235 240 

Lys Phe Leu Ser Ala Gly Pro Asn Ala Thr Asn Leu Tyr Tyr His Leu 
245 250 255 

15 Lys Arg Asp Tyr Ser Leu Tyr Leu Gly Tyr Asn Tyr Thr Phe _ 

" 265 270 

(2) INFORMATION FOR SEQ ID NO: 85: 

20 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH : 14 0 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

25 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .140 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



Met His Pro lie Met Phe Ala Tyr lie Ala Asn Ala Leu Ala Gin Ala 
15 10 15 

40 Arg Lys lie Asn Gly Thr Leu Cys Met Ala Phe Gin Lys lie Ser Gin 
20 25 30 

Val Lys Glu Leu Gly lie Asp Lys Ala Lys Ser Leu lie Gly Asn Leu 

35 40 45 

Ser Gin Val lie lie Tyr Pro Thr Lys Asp Thr Asp Glu Leu lie Glu 
45 50 55 60 

Cys Gly Val Pro Leu Ser Asp Ser Glu lie Asn Phe Leu His Asn Thr 
65 70 75 80 

Asp Met Arg Ala Arg Gin Val Leu Val Lys Asn lie Val Thr Asn Ala 
85 90 35 

50 Ser Ala Phe lie Glu lie Asp Leu Lys Lys lie Cys Lys Asn Tyr Phe 
100 105 110 

He Phe Leu He Ala Met Leu Val He Glu Lys Ser Ser Met He Leu 

115 120 125 

Lys Lys Gin Thr Lys Lys Leu He Arg Lys Ser He 
55 130 135 140 
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10 



15 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION l..,256 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



25 



Met 


Leu 


Gly 


Ser 


Val 


Lys 


Lys 


Ala 


val 


Phe 


Arg 


Val 


Leu 


Cys 


Leu 


Gly 


1 






5 










10 










15 




Ala 


Leu 


Cys 


Leu 


Cys 


Gly 


Gly 


Leu 


Met 


Ala 


Glu 


Gin 


Asp 


Pro 


Lys 


Glu 








20 










25 










30 






Leu 


lie 


Phe 


Ser 


Gly 


He 


Thr 


He 


Tyr 


Thr 


Asp 


Lys 


Asn 


Phe 


Thr 


Arg 






35 










40 










45 








Ala 


Lys 


Lys 


Tyr 


Phe 


Glu 


Lys 


Ala 


Cys 


Lys 


Ser 


Asn 


Asp 


Ala 


Asp 


Gly 




50 










55 










60 










Cys 


Ala 


lie 


Leu 


Arg 


Glu 


Val 


Tyr 


Ser 


Ser 


Gly 


Lys 


Ala 


He 


Ala 


Arg 


65 










70 










75 










80 


Glu 


Asn 


Ala 


Arg 


Glu 


Ser 


He 


Glu 


Lys 


Ala 


Leu 


Glu 


His 


Thr 


Ala 


Thr 










85 










90 










95 




Ala 


Lys 


Val 


Cys 


Lys 


Leu 


Asn 


Asp 


Ala 


Glu 


Lys 


Cys 


Lys 


Asp 


Leu 


Ala 








10 0 










105 










110 






Glu 


Phe 


Tyr 


Phe 


Asn 


Val 


Asn 


Asp 


Leu 


Lys 


Asn 


Ala 


Leu 


Glu 


Tyr 


Tyr 






115 










120 










125 








Ser 


Lys 


Ser 


Cys 


Lys 


Leu 


Asn 


Asn 


Val 


Glu 


Gly 


Cys 


Met 


Leu 


Ser 


Ala 




130 










135 










14 0 










Thr 


Phe 


Tyr 


Asn 


Asp 


Met 


He 


Lys 


Gly 


Leu 


Lys 


Lys 


Asp 


Lys 


Lys 


Asp 


145 










150 










155 










160 


Leu 


Glu 


Tyr 


Tyr 


Ser 


Lys 


Ala 


Cys 


Glu 


Leu 


Asn 


Asn 


Gly 


Gly 


Gly 


Cys 










165 










170 










175 




Ser 


Lys 


Leu 


Gly 


Gly 


Asp 


Tyr 


Phe 


Phe 


Gly 


Glu 


Gly Val 


Thr 


Lys 


Asp 








180 










185 










190 






Phe 


Lys 


Lys 


Ala 


Phe 


Glu 


Tyr 


Ser 


Ala 


Lys 


Ala 


Cys 


Glu 


Leu 


Asn 


Asp 






195 










200 










205 








Ala 


Lys 


Gly 


Cys 


Tyr 


Ala 


Leu 


Ala 


Ala 


Phe 


Tyr 


Asn 


Glu 


Gly 


Lys 


Gly 




210 










215 










220 










Val 


Ala 


Lys 


Asp 


Glu 


Lys 


Gin 


Thr 


Thr 


Glu 


Asn 


Leu 


Glu 


Lys 


Ser 


Cys 


225 








230 










235 










240 


Lys 


Leu 


Gly 


Leu 


Lys 


Glu 


Ala 


Cys 


Asp 


He 


Leu 


Lys 


Glu 


Gin 


Lys 


Gin 








245 










250 










255 





55 (2) INFORMATION FOR SEQ ID NO: 87: 
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(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



(ii) 



MOLECULE TYPE: protein 



(iii) 



HYPOTHET I CAL : YES 



10 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) 



FEATURE : 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .242 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 87 



20 Met Lys Lys Phe Phe Ser Gin Ser Leu Leu Ala Leu lie lie Ser Met 
15 10 15 

Asn Ala Val Ser Gly Met Asp Gly Asn Gly Val Phe Leu Gly Ala Gly 

20 25 30 

Tyr Leu Gin Gly Gin Ala Gin Met His Ala Asp lie Asn Ser Gin Lys 
25 35 40 45 

Gin Ala Thr Asn Ala Thr lie Lys Gly Phe Asp Ala Leu Leu Gly Tyr 

50 55 €0 

Gin Phe Phe Phe Glu Lys His Phe Gly Leu Arg Leu Tyr Gly Phe Phe 
65 70 75 80 

30 Asp Tyr Ala His Ala Asn Ser lie Lys Leu Lys Asn Pro Asn Tyr Asn 

85 90 95 

Ser Glu Ala Ala Gin Val Ala Ser Gin lie Leu Gly Lys Gin Glu lie 

100 105 110 

Asn Arg Leu Thr Asn lie Ala Asp Pro Arg Thr Phe Glu Pro Asn Met 

115 120 125 

Leu Thr Tyr Gly Gly Ala Met Asp Val Met Val Asn Val lie Asn Asn 

3L30 135 140 

Gly lie Met Ser Leu Gly Ala Phe Gly Gly lie Gin Leu Ala Gly Asn 
145 150 155 160 

Ser Trp Leu Met Ala Thr Pro Ser Phe Glu Gly lie Leu Val Glu Gin 

165 170 175 

Ala Leu Val Ser Lys Lys Ala Thr Ser Phe Gin Phe Leu Phe Asn Val 

180 185 190 

Gly Ala Arg Leu Arg lie Leu Lys His Ser Ser lie Glu Ala Gly Val 

195 200 205 

Lys Phe Pro Met Leu Lys Lys Asn Pro Tyr lie Thr Ala Lys Asn Leu 

210 215 220 

Asp lie Gly Phe Arg Arg Val Tyr Ser Trp Tyr Val Asn Tyr Val Phe 
225 230 235 240 

Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 88: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 67 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...267 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



Met Asn Tyr Pro Asn Leu Pro Asn Ser Ala Leu Glu lie Ser Glu Gin 
1 5 10 15 

20 Pro Glu Val Lys Glu lie Thr Asn Glu Leu Leu Lys Gin Leu Gin Asn 
20 25 30 

Ala Leu Arg Ser Asn Ala His Phe Ser Glu Gin Val Glu Leu Ser Leu 

35 40 45 

Lys Cys He Val Arg He Leu Glu Val Leu Leu Ser Leu Asp Phe Phe 
25 50 55 60 

Lys Asn Ala Asn Glu He Asp Ser Ser Leu Arg Asn Ser He Glu Trp 
65 70 75 80 

Leu Thr Asn Ala Gly Glu Ser Leu Lys Leu Lys Met Lys Glu Tyr Glu 
85 90 95 

30 Arg Phe Phe Ser Glu Phe Asn Thr Ser Met His Ala Asn Glu Gin Glu 
100 105 110 

Val Thr Asn Thr Leu Asn Ala Asn Ala Glu Asn He Lys Ser Glu He 

115 120 125 

Lys Lys Leu Glu Asn Gin Leu He Glu Thr Thr Thr Arg Leu Leu Thr 
35 130 135 140 

ser Tyr Gin He Phe Leu Asn Gin Ala Arg Asp Asn Ala Asn Asn Gin 
145 150 155 160 

He Thr Lys Asn Lys Thr Gin Ser Leu Glu Ala He Thr Gin Ala Lys 
165 170 175 

40 Asn Asn Ala Asn Asn Glu He Ser Asn Asn Gin Thr Gin Ala He Thr 
180 185 190 

Asn He Thr Glu Ala Lys Thr Asn Ala Asn Asn Glu He Ser Asn Asn 

195 200 205 

Gin Thr Gin Ala He Thr Asn He Asn Glu Ala Lys Glu Ser Ala Thr 
45 210 215 220 

Thr Gin He Asn Ala Asn Lys Gin Glu Ala He Asn Asn He Thr Gin 
225 230 235 240 

Glu Lys Thr Gin Ala Thr Ser Glu He Thr Glu Ala Lys Lys Thr Asp 
245 250 255 

50 His Tyr Gin Asn He Asp Phe Phe Glu Phe Glu 
260 265 

(2) INFORMATION FOR SEQ ID NO: 89: 
55 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .544 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Val lie Glu Thr lie Pro Lys His Ser Lys lie Val Leu Pro Gly Glu 
15 10 15 

20 Ala Phe Asp Ser Leu Lys Glu Ala Phe Asp Lys lie Asp Pro Tyr Thr 
20 25 30 

Phe Phe Phe Pro Lys Phe Glu Ala Thr Ser Thr Ser lie Ser Asp Thr 

35 40 45 

Asn Thr Gin Arg Val Phe Glu Thr Leu Asn Asn lie Lys Thr Asn Leu 
25 50 55 60 

lie Met Lys Tyr Ser Asn Glu Asn Pro Asn Asn Phe Asn Thr Cys Pro 
65 70 75 80 

Tyr Asn Asn Asn Gly Asn Thr Lys Asn Asp Cys Trp Gin Asn Phe Thr 
85 90 95 

30 Pro Gin Thr Ala Glu Glu Phe Thr Asn Leu Met Leu Asn Met lie Ala 
100 105 110 

Val Leu Asp Ser Gin Ser Trp Gly Asp Ala lie Leu Asn Ala Pro Phe 

115 120 125 

Glu Phe Thr Asn Ser Ser Thr Asp Cys Asp Ser Asp Pro Ser Lys Cys 
35 130 135 140 

Val Asn Pro Gly Val Asn Gly Arg Val Asp Thr Lys Val Asp Gin Gin 
145 150 155 160 

Tyr lie Leu Asn Lys Gin Gly lie lie Asn Asn Phe Arg Lys Lys lie 
165 170 175 

40 Glu lie Asp Ala Val Val Leu Lys Asn Ser Gly Val Val Gly Leu Ala 
180 185 190 

Asn Gly Tyr Gly Asn Asp Gly Glu Tyr Gly Thr Leu Gly Val Glu Ala 

195 200 205 

Tyr Ala Leu Asp Pro Lys Lys Leu Phe Gly Asn Asp Leu Lys Thr lie 
45 210 215 220 

Asn Leu Glu Asp Leu Arg Thr lie Leu His Glu Phe Ser His Thr Lys 
225 230 235 240 

Gly Tyr Gly His Asn Gly Asn Met Thr Tyr Gin Arg Val Pro Val Thr 
245 250 255 

50 Lys Asp Gly Gin Val Glu Lys Asp Ser Asn Gly Lys Pro Lys Asp Ser 
260 265 270 

Asp Gly Leu Pro Tyr Asn Val Cys Ser Leu Tyr Gly Gly Ser Asn Gin 

275 280 285 

Pro Ala Phe Pro Ser Asn Tyr Pro Asn Ser lie Tyr His Asn Cys Ala 
55 290 295 300 
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ASP Val Pro Ala Gly Phe Leu Gly Val Thr Ala Ala Val Trp Gin Gin 
305 310 315 320 

Leu He Asn Gin Asn Ala Leu Pro He Asn Tyr Ala Asn Leu Gly Ser 
325 330 335 

5 Gin Thr Asn Tyr Asn Leu Asn Ala Ser Leu Asn Thr Gin Asp Leu Ala 
340 345 350 

Asn Ser Met Leu Ser Thr He Gin Lys Thr Phe Val Thr Ser Ser Val 

355 360 365 

Thr Asn His His Phe Ser Asn Ala Ser Gin Ser Phe Arg Ser Pro He 
10 370 375 380 

Leu Gly Val Asn Ala Lys He Gly Tyr Gin Asn Tyr Phe Asn Asp Phe 
385 390 395 400 

He Gly Leu Ala Tyr Tyr Gly He He Lys Tyr Asn Tyr Ala Lys Ala 
405 410 415 

15 Val Asn Gin Lys Val Gin Gin Leu Ser Tyr Gly Gly Gly He Asp Leu 
420 425 430 

Leu Leu Asp Phe He Thr Thr Tyr Ser Asn Lys Asn Ser Pro Thr Gly 

435 440 445 

He Gin Thr Lys Arg Asn Phe Ser Ser Ser Phe Gly He Phe Gly Gly 
20 450 455 460 

Leu Arg Gly Leu Tyr Asn Ser Tyr Tyr Val Leu Asn Lys Val Lys Gly 
465 470 475 480 

ser Gly Asn Leu Asp Val Ala Thr Gly Leu Asn Tyr Arg Tyr Lys His 
485 490 495 

25 Ser Lys Tyr Ser Val Gly He Ser He Pro Leu He Gin Arg Lys Ala 
500 505 510 

Ser Val Val Ser Ser Gly Gly Asp Tyr Thr Asn Ser Phe Val Phe Asn 

515 520 525 

Glu Gly Ala Ser His Phe Lys Val Phe Phe Asn Tyr Gly Gly Cys Phe 
30 530 535 540 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 3 56 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



40 



45 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...356 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Leu Met Lys Ser He Leu Leu Phe Met He Phe Val Val Cys Gin Leu 
1 5 10 15 

Glu Gly Lys Lys Phe Ser Gin Asp Asn Phe Lys Val Asp Tyr Asn Tyr 
55 20 25 30 
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Tyr Leu Arg Lys Gin Asp Leu His lie lie Lys Thr Gin Asn Asp Leu 

35 40 45 

Ser Asn Ala Trp Tyr Leu Pro Pro Gin Lys Ala Pro Lys Glu His Ser 
50 55 60 

5 Trp Val Asp Phe Ala Lys Lys Tyr Leu Asn Met Met Asp Tyr Leu Gly 
65 70 75 80 

Thr Tyr Phe Leu Pro Phe Tyr His Ser Phe Thr Pro lie Phe Gin Trp 

85 90 95 

Tyr His Pro Asn lie Asn Pro Tyr Gin Arg Asn Glu Phe Lys Phe Gin 
10 100 105 110 

He Ser Phe Arg Val Pro Val Phe Arg His He Leu Trp Thr Lys Gly 

115 120 125 

Thr Leu Tyr Leu Ala Tyr Thr Gin Thr Asn Trp Phe Gin He Tyr Asn 
130 135 140 

15 Asp Pro Gin Ser Ala Pro Met_Ar.g _Met .lie. .Asn P.he -Met -Pro -Glu Leu 
145 150 155 160 

He Tyr Val Tyr Pro He Asn Phe Lys Pro Phe Gly Gly Lys He Gly 

165 170 175 

Asn Phe Ser Glu He Trp He Gly Trp Gin His He Ser Asn Gly Val 
20 180 185 190 

Gly Gly Ala Gin Cys Tyr Gin Pro Phe Asn Lys Glu Gly Asn Pro Glu 

195 200 205 

Asn Gin Phe Pro Gly Gin Pro Val He Val Lys Asp Tyr Asn Gly Gin 
210 215 220 

25 Lys Asp Val Arg Trp Gly Gly Cys Xaa Ser Val Xaa Xaa Gly Asn Xaa 
225 230 235 240 

Leu Cys Phe Val Leu Val Trp Glu Lys Gly Gly Leu Lys He Met Val 

245 250 255 

Ala Tyr Trp Pro Tyr Val Pro Tyr Asp Gin Ser Asn Pro Gin Leu lie 
30 260 265 270 

Asp Tyr Met Gly Tyr Gly Asn Ala Lys He Asp Tyr Arg Arg Gly Arg 

275 280 285 

His His Phe Glu Leu Gin Leu Tyr Asp He Phe Thr Gin Tyr Trp Arg 
290 295 300 

35 Tyr Asp Arg Trp His Gly Ala Phe Arg Leu Gly Tyr Thr Tyr Arg He 
305 310 315 320 

Asn Pro Phe Val Gly He Tyr Ala Gin Trp Phe Asn Gly Tyr Gly Asp 

325 330 335 

Gly Leu Tyr Glu Tyr Asp Val Phe Ser Asn Arg He Gly Val Gly He 
40 340 345 350 

Arg Leu Asn Pro 
355 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



55 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
5 (B) LOCATION 1...675 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Leu Ser Lys Gly Leu Ser He Gly Asn Lys He He Leu Cys Val Ala 
10 1 5 10 15 

Leu He Val He Val Cys Val Ser He Leu Gly Val Ser Leu Asn Ser 

20 25 30 

Arg Val Lys Glu He Leu Lys Glu Ser Ala Leu His Ser Met Gin Asp 
35 40 45 

15 Ser Leu His Phe Lys Val Lys Glu Val Gin Ser Val Leu Glu Asn Thr 
50 55 60 

Tyr Thr Ser Met Gly He Val Lys Glu Met Leu Pro Glu Asp Thr Lys 
65 70 75 80 

Arg Glu He Lys He Gin Leu Leu Lys Asn Phe He Leu Ala Asn Ser 
20 85 90 95 

His Val Ala Gly Val Ser Met Phe Phe Lys Asp Arg Glu Asp Leu Arg 

100 105 110 

Leu Thr Leu Leu Arg Asp Asn Asp Thr He Lys Leu Met Glu Asn Pro 
115 120 125 

25 Ser Leu Gly Ser Asn Pro Leu Ala Gin Lys Ala Met Lys Asn Lys Glu 
130 135 140 

He Ser Lys Ser Leu Pro Tyr Tyr Arg Lys Met Pro Asn Gly Ala Glu 
145 150 155 160 

Val Tyr Gly Val Asp He Leu Leu Pro Leu Phe Lys Glu Asn Thr Gin 
30 165 170 175 

Glu Val Val Gly Val Leu Met He Phe Phe Ser He Asp Ser Phe Ser 

180 185 190 

Asn Glu He Thr Lys Asn Arg Ser Asp Leu Phe Leu He Gly Val Lys 
195 200 205 

35 Gly Lys Val Leu Leu Ser Ala Asn Lys Ser Leu Gin Asp Lys Ser He 
210 215 220 

Thr Glu He Tyr Lys Ser Val Pro Lys Ala Thr Asn Glu Val Met Ala 
225 230 235 240 

He Leu Glu Asn Gly Ser Lys Ala Thr Leu Glu Tyr Leu Asp Pro Phe 
40 245 250 255 

Ser His Lys Glu Asn Phe Leu Ala Val Glu Thr Phe Lys Met Leu Gly 

260 265 270 

Lys Thr Glu Ser Lys Asp Asn Leu Asn Trp Met He Ala Leu He He 
275 280 285 

45 Glu Lys Asp Lys Val Tyr g;Lu Gin Val Gly Ser Val Arg Phe Val Val 
290 295 300 

Val Ala Ala Ser Ala He Met Val Leu Ala Leu He He Ala He Thr 
305 310 315 320 

Leu Leu Met Arg Ala He Val Ser Asn Arg Leu Glu Val Val Ser Ser 
50 325 330 335 

Thr Leu Ser His Phe Phe Lys Leu Leu Asn Asn Gin Ala His Ser Ser 

340 345 350 

Asp He Lys Leu Val Glu Ala Arg Ser Asn Asp Glu Leu Gly Arg Met 
355 360 365 

55 Gin Thr Ala He Asn Lys Asn He Leu Gin Thr Gin Lys Thr Met Gin 
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370 375 380 

Glu Asp Arg Gin Ala Val Gin Asp Thr lie Lys Val Val Ser Asp Val 
385 390 395 400 

Lys Ala Gly Asn Phe Ala Val Arg lie Thr Ala Glu Pro Ala Ser Pro 
5 405 410 415 

Asp Leu Lys Glu Leu Arg Asp Ala Leu Asn Gly lie Met Asp Tyr Leu 

420 425 430 

Gin Glu Ser Val Gly Thr His Met Pro Ser lie Phe Lys lie Phe Glu 
435 440 445 

10 Ser Tyr Ser Gly Leu Asp Phe Arg Gly Arg lie Gin Asn Ala Ser Gly 
450 455 460 

Arg Val Glu Leu Val Thr Asn Ala Leu Gly Gin Glu lie Gin Lys Met 
465 470 475 480 

Leu Glu Thr Ser Ser Asn Phe Ala Lys Asp Leu Ala Asn Asp Ser Ala 
15 485 490 495 

Asn -Leu Ly s Giu"Cys "V a 1 ^Gln"~A'srr"Leu Glu Lys "7[la Ser Asn Ser Gin 

500 505 510 

His Lys Ser Leu Met Glu Thr Ser Lys Thr lie Glu Asn lie Thr Thr 
515 520 525 

20 Ser lie Gin Gly Val Ser Ser Gin Ser Glu Ala Met lie Glu Gin Gly 
530 535 540 

Lys Asp lie Lys Ser lie Val Glu lie lie Arg Asp lie Ala Asp Gin 
545 550 555 560 

Thr Asn Leu Leu Ala Leu Asn Ala Ala lie Glu Ala Ala Arg Ala Gly 
25 565 570 575 

Glu His Gly Arg Gly Phe Ala Val Val Ala Asp Glu Val Arg Lys Leu 

580 585 590 

Ala Glu Arg Thr Gin Lys Ser Leu Ser Glu lie Glu Ala Asn lie Asn 
595 600 605 

30 He Leu Val Gin Ser He Ser Asp Thr Ser Glu Ser He Lys Asn Gin 
610 615 620 

Val Lys Glu Val Glu Glu lie Asn Ala Ser He Glu Ala Leu Arg Ser 
625 630 635 640 

Val Thr Glu Gly Asn Leu Lys He Ala Ser Asp Ser Leu Glu He Ser 
35 645 650 655 

Gin Glu He Asp Lys Val Ser Asn Asp He Leu Glu Asp Val Asn Lys 
660 665 670 

Lys Gin Phe 
675 



40 



(2) INFORMATION FOR SEQ ID NO : 92 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



55 



(ix) FEATURE: 
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10 



15 



20 



25 



30 



35 



40 



(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...271 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 



Met 


Asn 


He 


Phe 


Lys 


Arg 


He 


He 


Cys 


Val 


Thr 


Ala 


He 


Val 


Leu 


Gly 


1 








5 










10 










15 




Phe 


Phe 


Asn 


Leu 


Leu 


Asp 


Ala 


Lys 


His 


His 


Lys 


Glu 


Lys 


Lys 


UlU 


Asp 






20 










25 










30 




xiXS 


His 


Lys 


He 


Thr 


Arg 


Glu 


Leu 


Lys 


Val 


Gly Ala 


Asn 


Pro 


Val 


Pro 




35 










40 










45 








Ala 


Gin 


He 


Leu 


Gin 


Ser 


Val 


Val 


Asp 


Asp 


Leu 


Lys 


Glu 


Lys 


Gly 


lie 


50 










55 










60 










Lys 


Leu 


Val 


He 


Val 


Ser 


Phe 


Thr 


Asp 


Tyr 


Val 


Leu 


Pro 'Asn 


Leu 


Ala 


65 










70 










75 










80 


Leu 


Asn 


Asp 


Gly 


Ser 


Leu 


Asp 


Ala 


Asn 


Tyr 


Phe 


Gin 


His 


Arg 


Pro 


Xyr 






85 










90 










95 




Leu 


Asp 


Arg 


Phe 


Asn 


Leu 


Asp 


Arg 


Lys 


Met 


His 


Leu 


Val 


Gly 


Leu 


Ala 






100 










105 










110 






Asn 


He 


His 
115 


Val 


Glu 


Pro 


Leu 


Arg 
12 0 


Phe 


Tyr 


Ser 


Gin 


Lys 
125 


He 


Thr 


Asp 


He 


Lys 


Asn 


Leu 


Lys 


Lys 


Gly 


Ser 


Val 


He 


Ala 


Val 


Pro 


Asn 


Asp 


Pro 




130 










135 










140 








He 


Ala 


Asn 


Gin 


Gly 


Arg 


Ala 


Leu 


He 


Leu 


Leu 


His 


Lys 


Gin 


Gly 


Leu 


145 










150 










155 








He 


160 


Ala 


Leu 


Lys 


Asp 


Pro 


Ser 


Asn 


Leu 


Tyr 


Ala 


Thr 


Glu 


Phe 


Asp 


Val 








165 










170 










175 




Lys 


Asn 


Pro 


Tyr 


Asn 


He 


Lys 


He 


Lys 


Pro 


Leu 


Glu 


Ala 


Ala 


Leu 


Leu 






180 










185 










190 






Pro 


Lys 


Val 


Leu 


Gly 


Asp 


Val 


Asp 


Gly 


Ala 


He 


He 


Thr 


Gly 


Asn 


Tyr 




195 










200 










205 








Ala 


Leu 


Gin 


Ala 


Lys 


Leu 


Thr 


Gly 


Ala 


Leu 


Phe 


Ser 


Glu 


Asp 


Lys 


Asp 




210 










215 










220 






Gin 




Ser 


Pro 


Tyr 


Ala 


Asn 


Leu 


Val 


Ala 


Ser 


Arg 


Glu 


Asp 


Asn 


Ala 


Asp 


225 








230 










235 










240 


Glu 


Ala 


He 


Lys 


Ala 


Leu 


He 


Glu 


Ala 


Leu 


Gin 


Ser 


Glu 


Lys 


Thr 


Arg 








245 










250 










255 




Lys 


Phe 


He 


Leu 


Asp 


Thr 


Tyr 


Lys 


Gly Ala 


He 


He 


Pro 


Ala 


Phe 








260 










265 










270 







(2) INFORMATION FOR SEQ ID NO: 93 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY : linear 

(ii) MOLECXJLE TYPE: protein 

50 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

55 (ix) FEATURE: 
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(A) NAME/KEY: misc__f eature 

(B) LOCATION 1.,.161 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

5 

Met Phe Phe Lys Thr Tyr Gin Lys Leu Leu Gly Ala Ser Cys Leu Ala 
15 10 15 

Leu Tyr Leu Val Gly Cys Gly Asn Gly Gly Gly Gly Glu Ser Pro Val 
20 25 30 

10 Glu Met lie Ala Asn Ser Glu Gly Thr Phe Gin lie Asp Ser Lys Ala 
35 40 45 

Asp Ser lie Thr lie Gin Gly Val Lys Leu Asn Arg Gly Asn Cys Ala 

50 55 60 

Val Asn Phe Val Pro Val Ser Glu Thr Phe Gin Met Gly Val Leu Ser 

15 65 70 75 80 

"Gin Val" Ttir~Pro He S^er ^le Gin Asp Phe Lys Asp Met Ala Ser Thr 

85 90 95 

Tyr Lys lie Phe Asp Gin Lys Lys Gly Leu Ala Asn lie Ala Asn Lys 
100 105 110 

20 lie Ser Gin Leu Glu Gin Lys Gly Val Met Met Glu Pro Gin Thr Leu 
115 120 125 

Asn Phe Gly Glu Ser Leu Lys Gly lie Ser Gin Gly Cys Asn lie lie 

130 135 140 

Glu Ala Glu lie Gin Thr Asp Lys Gly Ala Trp Thr Phe Asn Phe Asp 
25 145 150 155 160 

Lys 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc^feature 
45 (B) LOCATION 1.-..337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Met lie Arg Leu Lys Gly Leu Asn Lys Thr Leu Lys Thr Ser Leu Leu 

50 1 5 10 15 

Ala Gly Val Leu Leu Gly Ala Thr Ala Pro Leu Met Ala Lys Pro Leu 

20 25 30 

Leu Ser Asp Glu Asp Leu Leu Lys Arg Val Lys Leu His Asn lie Lys 
35 40 45 

55 Glu Asp Thr Leu Thr Ser Cys Asn Ala Lys -Val Asp Gly Ser Gin Tyr 
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10 



15 



20 



25 



30 



35 



40 



45 





50 










55 










60 










Leu 


Asn 


Ser 


Gly 


Trp 


Asn 


Leu 


Ser 


Lys 


Glu 


Phe 


Pro 


Gin 


Glu 


Tyr 


Arg 


65 










70 










75 










80 


Glu 


Lys 


He 


Phe 


Glu 


Cys 


Val 


Glu 


Glu 


Glu 


Lys 


His 


Lys 


Gin 


Ala 


Leu 








85 










90 










95 




Asn 


Leu 


He 


Asn 


Lys 


Glu 


Asp 


Thr 


Glu 


Asp 


Lys 


Glu 


Glu 


Leu 


Ala 


Lys 








100 










105 










110 






Lys 


He 


Lys 


Glu 


He 


Lys 


Glu 


Lys 


Ala 


Lys 


Val 


Leu 


Arg 


Gin 


Lys 


Phe 




115 










120 










125 








Met 


Ala 


Phe 


Glu 


Met 


Lys 


Glu 


His 


Ser 


Lys 


Glu 


Phe 


Pro 


Asn 


Lys 


Lys 




130 










135 










140 










Gin 


Leu 


Gin 


Thr 


Met 


Leu 


Glu 


Asn 


Ala 


Phe 


Asp 


Asn 


Gly 


Ala 


Glu 


Ser 


145 










150 










155 










160 


Phe 


He 


Asp 


Asp 


Trp 


His 


Glu 


Arg 


Phe 


Gly Gly 


He 


Ser 


Arg 


Glu 


Asn 










165 










170 










175 




Thr 


Tyr 


Lys 


Ala 


Leu 


Gly 


He 


Lys 


Glu 


Tyr 


Ser 


Asp 


Glu 


Gly 


Lys 


He 








180 










185 










190 






Leu 


Ala 


Phe 


Gly 


Glu 


Arg 


Ser 


Tyr 


He 


Arg 


Gin 


Tyr 


Lys 


Lys 


Asp 


Phe 






195 










200 










205 








Glu 


Glu 


Ser 


Thr 


Tyr 


Asp 


Thr 


Arg 


Gin 


Thr 


Leu 


Ser 


Ala 


Met 


Ala 


Asn 




210 










215 










220 










Met 


Ser 


Gly 


Glu 


Asn 


Asp 


Tyr 


Lys 


He 


Thr 


Trp 


Leu 


Lys 


Pro 


Lys 


Tyr 


225 










230 










235 










240 


Gin 


Leu 


His 


Ser 


Ser 


Asn 


Asn 


He 


Lys 


Pro 


Leu 


Met 


Ser 


Asn 


Thr 


Glu 










245 










250 










255 




Leu 


Leu 


Asn 


Met 


He 


Glu 


Leu 


Thr 


Asn 


He 


Lys 


Lys 


Glu 


Tyr 


Val 


Met 








260 










265 










270 






Gly 


Cys 


Asn 


Met 


Glu 


He 


Asp 


Gly 


Ser 


Lys 


Tyr 


Pro 


He 


His 


Lys 


Asp 






275 










280 










285 








Trp 


Gly 


Phe 


Phe 


Gly 


Lys 


Ala 


Lys 


Val 


Pro 


Glu 


Thr 


Trp 


Arg 


Asn 


Lys 




290 










295 










300 










He 


Trp 


Glu 


Cys 


He 


Lys 


Asn 


Lys 


Val 


Lys 


Ser 


Tyr 


Asp 


Asn 


Thr 


Thr 


305 








310 










315 










320 


Ala 


Glu 


He 


Gly 


He 


Val 


Trp 


Lys 


Lys 


Asn 


Thr 


Tyr 


Ser 


He 


Ser 


His 










325 










330 










335 




His 

































(2) INFORMATION FOR SEQ ID NO: 95: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
55 (B) LOCATION 1...416 



3NSDOCID: <WO 9818323A1J_> 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Met Lys Lys Leu Val Phe Ser Met Leu Leu Cys Cys Lys Ser Val Phe 
5 1 5 10 15 

Ala Glu Gly Glu Thr Pro Leu lie Val Asn Asp Pro Glu Thr His Val 

20 25 30 

Ser Gin Ala Thr lie lie Gly Lys Met Val Asp Ser lie Lys Arg Tyr 
35 40 45 

10 Glu Glu lie lie Ser Lys Ala Gin Ala Gin Val Asn Gin Leu Gin Lys 
50 55 60 

Val Asn Asn Met lie Asn Thr Thr Asn Ser Leu lie Ser Ser Ser Ala 
65 70 75 80 

lie Thr Leu Ala Asn Pro Met Gin Val Leu Gin Asn Ala Gin Tyr Gin 
15 85 90 95 

_T 1 e _G 1 u-S e r— 1 1 e- A-rg -T^t*- -As n— Ty-r^Gl-u— Asn— Leu- Ly s- -Gin- -S e r ^Irle-Giu 

100 105 110 

Asn Trp Asn Ala Gin Asn Leu Leu Arg Asn Lys Tyr Leu Gin Gin Gin 
115 120 125 

20 Cys Pro Trp Leu Asn Val Asn Ala Leu Thr Asn Asn Lys lie Val Asn 
130 135 140 

Leu Lys Asp Leu Asn Asn Leu lie Thr Lys Asn Gly Glu Gin Thr Gin 
145 150 155 160 

Thr Ala Arg Asp Val Gin Asn Leu lie Gin Ser lie Ser Gly Ser Gly 
25 165 170 175 

Tyr Gly Asn Met Gin Ser Leu Ala Gly Glu Leu Ser Gly Arg Ala Trp 

180 185 190 

Gly Glu Met Leu Cys Lys Met Val Asn Asp Ser Asn Tyr Glu Ser Glu 
195 200 205 

30 Gin Ala Leu Leu Ala Thr Gly Asn Asn Pro Glu Glu Gin Lys Arg Arg 
210 215 220 

Phe Leu Leu Arg Val Lys Lys Lys Val Asn Asp Asn Lys Gin Leu Lys 
225 230 235 240 

Asp Lys Leu Asp Pro Phe Leu Lys Arg Leu Asp Val Leu Gin Thr Glu 
35 245 250 255 

Phe Gly Val Thr Asp Pro Thr Ala Asn His Asn Lys Gin Gly lie His 

260 265 270 

Tyr Cys Thr Glu Asn Lys Glu Thr Gly Lys Cys Asp Pro lie Lys Asn 
275 280 285 

40 Val Phe Arg Thr Thr Arg Leu Asp Asn Glu Leu Glu Gin Glu lie Gin 
290 295 300 

Thr Leu Thr Leu Asp Leu lie Lys Ala Ser Asn Lys Asp Ala Gin Ser 
305 310 315 320 

Gin Ala Tyr Ala Asn Phe Asn Gin Arg lie Lys Leu Leu Thr Leu Lys 
45 325 330 335 

Tyr Leu Lys Glu lie Thr Asn Gin Met Leu Phe Leu Asn Gin Thr Met 

340 345 350 

Ala Met Gin Ser Glu lie Met Thr Asp Asp Tyr Phe Arg Gin Asn Asn 
355 360 365 

50 Asp Gly Phe Gly Glu Lys Glu Asn His lie Asp Lys Gin Leu Thr Gin 
370 375 380 

Lys Arg lie Asn Glu Arg Glu Arg Ala Arg lie Tyr Phe Gin Asn Pro 
385 390 395 400 

Asn Val Lys Phe Asp Gin Phe Gly Phe Pro lie Phe Ser lie Trp Asp 
55 405 410 415 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1 . . . 376 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Val Asn Lys Trp He Lys Gly Ala Val Val Phe Val Gly Gly Phe Ala 
1 5 10 15 

Thr He Thr Thr Phe Ser Leu He Tyr His Gin Lys Pro Lys Ala Pro 
25 20 25 30 

Leu Asn Asn Gin Pro Ser Leu Leu Asn Asp Asp Glu Val Lys Tyr Pro 

35 40 45 

Leu Gin Asp Tyr Thr Phe Thr Gin Asn Pro Gin Pro Thr Asn Thr Glu 
50 55 60 

30 Ser Ser Lys Asp Ala Thr He Lys Ala Leu Gin Glu Gin Leu Lys Ala 
65 70 75 80 

Ala Leu Lys Ala Leu Asn Ser Lys Glu Met Asn Tyr Ser Lys Glu Glu 

85 90 95 

Thr Phe Thr Ser Pro Pro Met Asp Pro Lys Thr Thr Pro Pro Lys Lys 
35 100 105 110 

Asp Phe Ser Pro Lys Gin Leu Asp Leu Leu Ala Ser Arg He Thr Pro 

115 120 125 

Phe Lys Gin Ser Pro Lys Asn Tyr Glu Glu Asn Leu He Phe Pro Val 
130 135 140 

40 Asp Asn Pro Asn Gly He Asp Ser Phe Thr Asn Leu Lys Glu Lys Asp 
145 150 155 160 

He Ala Thr Asn Glu Asn Lys Leu Leu Arg Thr He Thr Ala Asp Lys 

165 170 175 

Met He Pro Ala Phe Leu He Thr Pro He Ser Ser Gin He Ala Gly 
45 180 185 190 

Lys val He Ala Gin Val Glu Ser Asp He Phe Ala Ser Met Gly Lys 

195 200 205 

Ala Val Leu He Pro Lys Gly Ser Lys Val He Gly Tyr Tyr Ser Asn 
210 215 220 

50 Asn Asn Lys Met Gly Glu Tyr Arg Leu Asp He Val Trp Ser Arg He 
225 230 235 240 

He Thr Pro His Gly He Asn He Met Leu Thr Asn Ala Lys Gly Ala 

245 250 255 

Asp He Lys Gly Tyr Asn Gly Leu Val Gly Glu Leu He Glu Arg Asn 
55 260 265 270 



9818323A1_I_> 
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10 



15 



35 



40 



45 



50 



55 



n V. M 

Pne 


Gin 


Arg 
275 


Tyr 


tjxy 


vax 


Pro 


Leu 
280 


Leu 


Leu 


Ser 


Thr 


Leu 
285 


Thr 


Asn 


ux y 


Leu 


Leu 


xxe 


(jxy 


xxe 


rpVi -y- 

inr 


Ser 


>\i.a 


Leu 


Asn 


Asn 


Arg Gly 


Asn 


Lys 


uXU 




290 










295 










300 










Glu 


Val 


Thr 


Asn 


1-1 V-. -s. 

Pne 


Pne 


Gly 


Asp 


Tyr 


Leu 


Leu 


Leu 


Gin 


Leu 


Met 


Arg 


305 










310 










315 










320 


Gin 


Ser 


Gly 


Met 


Gly 


He 


Asn 


Gin 


Val 


Val 


Asn 


Gin 


He 


Leu Arg 


Asp 










325 










330 










335 




Lys 


Ser 


Lys 


He 
340 


Ala 


Pro 


He 


Val 


Val 
345 


He 


Arg 


Glu 


Gly 


Ser 
350 


Arg 


Val 


Phe 


He 


Ser 
355 


Pro 


Asn 


Thr 


Asp 


He 
360 


Phe 


Phe 


Pro 


He 


Pro 
365 


Arg 


Glu 


Asn 


Glu 


Val 
370 


He 


Ala 


Glu 


Phe 


Leu 
375 


Lys 



















(2) INFORMATION"FOR "SEQ ID" NO: 97 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 916 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

30 (ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1. . .916 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



Val 


Asp 


Leu 


Arg 


He 


Gin 


Ser 


Lys 


Glu 


Val 


Ser 


His 


Asn 


Leu 


Lys 


Glu 


1 








5 










10 










15 




Leu 


Ser 


Lys 


Thr 


Leu 


He 


Ser 


Tyr 


Pro 


Phe 


Glu 


Lys 


His 


Val 


Glu 


Ala 








20 










25 










30 






Leu 


Gly Glu 


Gin 


Cys 


Ser 


Asn 


Phe 


Val 


Ser 


He 


Pro 


He 


Asn 


Asn 


Asp 






35 










40 










45 








Asp 


Tyr 


Ser 


Asn 


He 


Cys 


Thr 


Phe 


Val 


Ser 


Asp 


Phe 


He 


Asn 


Leu 


He 




50 










55 










60 










Ala 


Ser 


Tyr 


Asn 


Leu 


Leu 


Glu 


Ser 


Phe 


Leu 


Asp 


Phe 


Tyr 


Lys 


Asp 


Lys 












70 










75 










80 


Leu 


Lys 


Leu 


Ser 


Glu 


Leu 


Val 


Thr 


Glu 


Tyr 


Ala 


Asn 


Val 


Thr 


Asn 


Asn 










85 










90 










95 




Leu 


Leu 


Phe 


Lys 


Lys 


Leu 


He 


Lys 


His 


Leu 


Ser Gly Asn 


Asn 


Gin 


Leu 








100 










105 










110 






Val 


Lys 


Asn 


Phe 


Tyr 


Gin 


Cys 


He 


Arg 


Glu 


He 


He 


Lys 


Tyr 


Asn 


Ala 






115 










120 










125 








Pro 


Asn 


Lys 


Glu 


Tyr 


Lys 


Pro 


Asn 


Gin 


Phe 


Phe 


He 


He 


Gly Lys 


Gly 




130 










135 










140 










Lys 


Gin 


Lys 


Gin 


Leu 


Ala 


Lys 


He 


Tyr 


Ser 


His 


Leu 


Lys 


Glu 


Leu 


Ser 


145 










150 










155 










160 
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Ala ser Glu He Lys Pro Gin Asp Met Glu Asp He Leu Lys Lys Leu 

165 170 175 

Glu Glu Leu Asp Lys He Phe Lys Thr Thr Asp Phe Thr Lys Phe Thr 
180 185 190 

5 Pro Lys Thr Glu He Lys Asp He He Lys Glu He Asp Glu Lys Tyr 
195 200 205 

Pro He Asn Glu Asn Phe Lys Arg Gin Phe Asn Glu Phe Glu Ser Asn 

210 215 220 

He Glu Lys His Asp Glu He Lys Lys Asp Phe Glu Arg Asn Lys Glu 
10 225 230 235 240 

Ser Leu He Arg Glu He Glu Asn His Cys Lys Asn Glu Cys Asn Ser 

245 250 255 

Glu Glu Glu Pro Glu Tyr Lys He Asn Asp Leu Leu Lys Asn He Gin 
260 265 270 

15 Gin He Cys Lys Asn Tyr He Glu Ser His Ala Val Asn Asp Val Ser 
275 280 285 

Lys Asp He Lys Ser Met Met Cys Gin Phe Tyr Leu Lys Gin He Asp 

290 295 300 

Leu Leu Val Asn Ser Glu He Val Arg Tyr Arg Tyr Ser Asn Leu Phe 
20 305 310 315 320 

Glu Pro He Gin Arg Ser Leu Trp Glu Ser He Lys He Leu Asp Asn 

325 330 335 

Glu Ser Gly He Tyr Leu Phe Pro Lys Asn He Gly Glu He Lys Asp 
340 345 350 

25 Lys Phe Glu Ala Asn Lys Glu Lys Phe Lys Gin Ser Lys Asn Val Ser 
355 360 365 

Glu Phe Ala Glu Tyr Cys Arg Glu Cys Asn Pro Tyr Thr Ala Phe Asn 

370 375 380 

Phe His Leu Asn He Asn Asn Gly Leu Ser His Gin Phe Glu Lys Phe 
30 385 390 395 400 

Val Pro He Met Lys Glu Tyr Lys Glu Pro Lys He Thr Asp Asn Asp 

405 410 415 

Leu Glu Ala He Ser Thr Lys Glu Thr Gly Leu Ala Ser Gin Leu Ser 
420 425 430 

35 Gly His Trp Phe Phe Gin Leu Ser Leu Phe Asn Lys Thr Asn Phe Asn 
435 440 445 

Pro Asn Lys He Trp He Pro Leu Glu Phe Asn Lys Arg Ser Lys He 

450 455 460 

Lys Phe Asp Lys Asp Leu Glu He Tyr Phe Asp Ser His Glu Ser Phe 
40 465 470 475 480 

Asn He Ser Lys Lys Tyr Leu Gin Glu He Asp Gin Glu Ser Leu Lys 

485 490 495 

Lys He Lys Gin Ser Lys Asp Phe Phe Ser He Gin Lys He Glu Ser 
500 505 510 

45 Lys His Asp Asn Asn Asp lie Leu Gin Leu Glu Phe Phe Glu Asn Asp 
515 520 525 

Thr Ser Phe Leu Phe Ala Lys Gly Ser Phe Ala Glu He Leu Glu Tyr 

530 535 540 

Asn Met Gin Leu Lys He Asp Ser Leu He Thr Lys Glu Phe Asn Lys 
50 545 550 555 560 

Leu Leu Ala He Val Gin Asp Ser Pro Gin Asp Ser Tyr Gin Leu Lys 

565 570 575 

He Arg Val Arg His Asn Asn Lys Leu Pro Arg Glu Lys Tyr Thr Glu 
580 585 590 

55 His Glu He Lys Leu Glu Val Tyr Asp Cys Arg Lys Ser His Asp His 
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595 600 605 

Asn Glu Pro lie lie Leu Ser Gin Gin Ser Thr Gly Phe Gin Trp Ala 

610 615 620 

Phe Asn Phe Met Phe Gly Phe Leu Tyr Asn Val Gly Ser His Phe Ser 
5 625 630 635 640 

Phe Asn His Asn lie lie Tyr Val Met Asp Glu Pro Ala Thr His Leu 

645 650 655 

Ser Val Pro Ala Arg Lys Glu Phe Arg Lys Phe Leu Lys Glu Tyr Ala 
660 665 670 

10 His Lys Asn His Val Thr Phe Val Leu Ala Thr His Asp Pro Phe Leu 
675 680 685 

Val Asp Thr Asp His Leu Asp Glu lie Arg lie Val Glu Lys Glu Thr 

690 695 700 

Glu Gly Ser Val lie Lys Asn His Phe Asn Tyr Pro Leu Asn Asn Ala 
15 705 710 715 720 

Ser Lys Asp Ser -Asp A.la Leu Asp Lys -I-l-e-Lys— Arg-Ser Leu Giy-Vai 

725 730 735 

Gly Gin His Val Phe His Asn Pro Gin Lys His Arg lie lie Phe Val 
740 745 750 

20 Glu Gly lie Thr Asp Tyr Cys Tyr Leu Ser Ala Phe Lys Leu Tyr Leu 
755 760 765 

Arg Tyr Lys Glu Tyr Lys Asp Asn Pro lie Pro Phe Thr Phe Leu Pro 

770 775 780 

lie Ser Gly Leu Lys Asn Asp Ser Asn Asp Met Lys Glu Thr lie Glu 
25 785 790 795 800 

Lys Leu Cys Glu Leu Asp Asn His Pro lie Val Leu Thr Asp Asp Asp 

805 810 815 

Arg Lys Cys Val Phe Asn Gin Gin Ala Thr Ser Glu Arg Phe Lys Arg 
820 825 830 

30 Ala Asn Glu Glu Met His Asp Pro He Thr He Leu Gin Leu Ser Asp 
835 840 845 

Cys Asp Arg His Phe Lys Gin He Glu Asp Cys Phe Ser Ala Asn Asp 

850 855 860 

Arg Asn Lys Tyr Ala Lys Asn Lys Gin Met Glu Leu Ser Met Ala Phe 
35 865 870 875 880 

Lys Thr Arg Leu Leu Tyr Gly Gly Glu Asp Ala He Glu Lys Gin Thr 

885 890 895 

Lys Arg Asn Phe Leu Lys Leu Phe Lys Trp He Ala Trp Ala Thr Asn 
900 905 910 

40 Leu He Lys Asn 
915 

(2) INFORMATION FOR SEQ ID NO: 98: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (ii) MOLECtlLE TYPE: protein 

(iii) HYPOTHETICAL: YES 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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10 



15 



20 



25 



30 



50 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . . 176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



Met 


Thr 


Ala 


Met: 


Met 


Arg 


Tyr 


Phe 


His 


He 


Tyr 


Ala 


Thr 


Thr 


fne 


xrne 


1 








5 










10 










15 




Phe 


Pro 


Leu 


Ala 


Leu 


Leu 


Phe 


Ala 


Val 


Ser 


Gly 


Leu 


Ser 


Leu 


Leu 


Phe 






20 










25 










30 






Lys 


Ala 


Arg 


Gin 


Asp 


Thr 


Gly 


Ala 


Lys 


He 


Lys 


Glu 


Trp 


Val 


Leu 


Glu 




35 










40 










45 






He 


Iiys 


Ser 


Leu 


Lys 


Lys 


Glu 


Glu 


Arg 


Leu 


Asp 


Phe 


Leu 


Lys 


Gly 


Phe 


50 










55 










60 






Glu 




lays 


Glu 


Asn 


His 


lie 


Ala 


Met 


Pro 


Lys 


Lys 


He 


Glu 


Pro 


Arg 


Tyr 


65 










70 










75 










80 


Arg 


Gly 


Ala 


Leu 


Val 


He 


Gly 


Thr 


Pro 


Leu 


Tyr 


Glu 


He 


Asn 


Leu 


Glu 






85 










90 










95 




Thr 


Lys 


Gly 


Thr 


Gin 


Thr 


Lys 


He 


Lys 


Thr 


He 


Glu 


Arg 


Gly 


Phe 


Leu 






100 










105 










110 






Gly 


Ala 


Leu 


lie 


Met 


Leu 


His 


Lys 


Ala 


Lys 


Val 


Gly 


He 


Val 


Phe 


Gin 




115 










120 










125 








Ala 


Leu 


Leu 


Gly 


He 


Phe 


Cys 


Val 


Phe 


Leu 


Leu 


Leu 


Phe 


Tyr 


Leu 


Ser 




130 










135 










140 










Ala 


Phe 


Leu 


Met 


Val 


Ala 


Phe 


Lys 


Asp 


Thr 


Lys 


Arg 


Met 


Phe 


He 


Ser 


145 










150 










155 










160 


Val 


Leu 


lie 


Gly 


Ser 


Val 


Val 


Phe 


Phe 


Gly Ala 


He 


Tyr 


Trp 


Ser 


Leu 








165 










170 










175 





(2) INFORMATION FOR SEQ ID NO: 99: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 
35 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

40 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

45 (ix) FEATURE: 

(A) NAME/KEY:' misc_feature 

(B) LOCATION 1. . .222 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



Met Phe Lys Asn Ala Leu Asn He Gin Asp Phe Ser Phe Lys Asn His 
1 5 10 i5 

Thr Ser Thr Ala He He Gly Thr Asn Gly Ala Gly Lys Ser Thr Leu 
20 25 30 

55 He Asn Thr He Leu Gly He Arg Ser Asp Tyr Asn Phe Lys Ala Gin 
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35 40 45 

Asn Asn Asn lie Pro Tyr His Asp Asn Val lie Pro Gin Arg Lys Gin 

50 55 60 

Leu Gly Val Val Ser Asn Leu Phe Asn Tyr Pro Pro Gly Leu Asn Ala 
5 65 70 75 80 

Asn Asp Leu Phe Lys Phe Tyr Gin Phe Phe His Lys Asn Cys Thr Leu 

85 90 95 

Asp Leu Phe Glu Lys Asn Leu Leu Asn Lys Thr Tyr Glu His Leu Ser 
100 105 110 

0 Asp Gly Gin Lys Gin Arg Leu Lys lie Asp Leu Ala Leu Ser His His 
115 120 125 

Pro Gin Leu Val lie Met Asp Glu Pro Glu Thr Ser Leu Glu Gin Asn 

130 135 140 

Ala Leu He Arg Leu Ser Asn Leu He Ser Leu Arg Asn Thr Gin Gin 
5 145 150 155 160 

-Leu- -T-h2^S e l^— I-l-e— I-l-e— Ala- -Thr — Ki-s— Asp -P r o— lie- Va-i~Leu— Asp~S e r "Cy s 

165 170 175 

Glu Trp Val Leu Leu Leu Lys Asn Gly Asn He Ala Gin Tyr Lys Pro 
180 185 190 

0 Leu Asn Ser lie Leu Lys Ser Val Ala Lys Thr Phe Asn Phe Lys Glu 
195 200 205 

Lys Pro Thr Thr Lys Asp Leu Leu Ala Leu Leu Lys Asp He . 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 100: 



(i) SEQUENCE CHT^RACTERISTICS : 

(A) LENGTH: 406 amino acids 

(B) TYPE : amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1.;.406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



Met Tyr Ala Ala His Pro He Lys 
1 5 
Ser Gin Phe Leu Arg Arg Val Phe 
20 

Asn Asp Gin Ala Cys Pro Leu Glu 

35 40 
His Lys Ala Met He Ala Tyr Leu 

. 50 55 
Gly Lys Asp Leu Asp Leu Asp Leu 
65 70 
Phe Leu Glu Arg Leu Val Leu Thr 



Pro He Lys Ala Pro Lys Leu Lys 

10 IS 
Val Gly Ala Ser He Arg Arg Trp 
25 30 
Phe Val Glu Leu Asp Lys Gin Ala 
45 

Leu Ala Lys Asp Leu Lys Asp Arg 
60 

Leu He Lys Tyr Phe Cys Phe Glu 

75 80 
Asp He Lys Pro Pro He Phe Tyr 
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85 90 95 

Ala Leu Gin Gin Thr His Ser Lys Glu Leu Ala Ser Tyr Val Ala Gin 

100 105 110 

ser Leu Gin Asp Glu lie Ser Ala Tyr Phe Ser Leu Glu Glu Leu Lys 

115 120 125 

Glu Tyr Leu Ser His Arg Pro Gin lie Leu Glu Thr Gin lie Leu Glu 

130 135 140 

Ser Ala His Phe Tyr Ala Ser Lys Trp Glu Phe Asp He He Tyr Hxs 
45 150 155 160 

Phe Asn Pro Asn Met Tyr Gly Val Lys Glu He Lys Asp Lys He Asp 

165 170 175 

Lvs Gin Leu His Asn Asn Asp His Leu Phe Glu Gly Leu Phe Gly Glu 

180 185 190 

Lys Glu Asp Leu Lys Lys Leu Val Ser Met Phe Gly Gin Leu Arg Phe 
J5 195 200 205 

Gin Lys Arg Trp Ser Gin Thr Pro Arg Val Pro Gin Thr Ser Val Leu 

210 215 220 

Glv His Thr Leu Cys Val Ala He Met Gly Tyr Leu Leu Ser Phe Asp 
225 230 235 240 

70 Leu Lys Ala Cys Lys Ser Met Arg He Asn His Phe Leu Gly Gly Leu 

245 250 255 

Phe His Asp Leu Pro Glu He Leu Thr Arg Asp He He Thr Pro He 

260 265 270 

Lys Gin Ser Val Ala Gly Leu Asp His Cys He Lys Glu He Glu Lys 
25 275 280 285 

Lys Glu Met Gin Asn Lys Val Tyr Ser Phe Val Ser Leu Gly Val Gin 

290 295 300 

Glu Asp Leu Lys Tyr Phe Thr Glu Asn Glu Phe Lys Asn Arg Tyr Lys 
05 310 315 220 

30 ASP Lys ser His Gin He Val Phe Thr Lys Asp Ala Glu Glu Leu Phe 

325 330 335 

Thr Leu Tyr Asn Ser Asp Glu Tyr Leu Gly Val Cys Gly Glu Leu Leu 

340 345 350 

Lys val cys Asp His Leu Ser Ala Phe Leu Glu Ala Gin He Ser Leu 
35 355 360 365 

ser His Gly He Ser Ser Tyr Asp Leu He Gin Gly Ala Lys Asn Leu 

370 375 380 

Leu Glu Leu Arg Ser Gin Thr Glu Leu Leu Asp Leu Asp Leu Gly Lys 
385 390 395 400 

40 Leu Phe Arg Asp Phe Lys 

405 

(2) INFORMATION FOR SEQ ID NO: 101: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

50 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Helicobacter pylori 



3NSDOC1D- <WO 9818323A1_I_> 
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(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .335 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

Val Leu Trp Val Leu Tyr Phe Leu Thr Ser Leu Phe lie Cys Ser Leu 
1 5 10 15 

10 lie Val Leu Trp Ser Lys Lys Ser Met Leu Phe Val Asp Asn Ala Asn 
20 25 30 

Lys He Gin Gly Phe His His Ala Arg Thr Pro Arg Ala Gly Gly Leu 

35 40 45 

Gly He Phe Leu Ser Phe Ala Leu Ala Cys Tyr Leu Glu Pro Phe Glu 
15 50 55 _ _ 

— ^Met"~Pro— Phe~Lys— Gly~Pro~Phe~Val~ Phe" Leu^ Gly 'LeuT Ser Leu Val" Phe 
65 70 75 80 

Leu Ser Gly Phe Leu Glu Asp He Asn Leu Ser Leu Ser Pro Lys He 
85 90 95 

20 Arg Leu He Leu Gin Ala Val Gly Val Val Cys He He Ser Ser Thr 
100 105 110 

Pro Leu Val Val Ser Asp Phe Ser Pro Leu Phe Ser Leu Pro Tyr Phe 

115 120 125 

He Ala Phe Leu Phe Ala He Phe Met Leu Val Gly He Ser Asn Ala 
25 130 135 140 

He Asn He He Asp Gly Phe Asn Gly Leu Ala Ser Gly He Cys Ala 
145 150 155 160 

He Ala Leu Leu Val He His Tyr He Asp Pro Ser Ser Leu Ser Cys 
165 170 175 

30 Leu Leu Ala Tyr Met Val Leu Gly Phe Met Val Leu Asn Phe Pro Ser 
180 185 190 

Gly Lys He Phe Leu Gly Asp Gly Gly Ala Tyr Phe Leu Gly Leu Val 

195 200 205 

Cys Gly He Ser Leu Leu His Leu Ser Leu Glu Gin Lys He Ser Val 
35 210 215 220 

Phe Phe Gly Leu Asn Leu Met Leu Tyr Pro Val He Glu Val Leu Phe 
225 230 235 240 

Ser He Leu Arg Arg Lys He Lys Arg Gin Lys Ala Thr Met Pro Asp 
245 250 255 

40 Asn Leu His Leu His Thr Leu Leu Phe Lys Phe Leu Gin Gin Arg Ser 
260 265 270 

Phe Asn Tyr Pro Asn Pro Leu Cys Ala Phe He Leu He Leu Cys Asn 

275 280 285 

Leu Pro Phe He Leu He Ser Val Leu Phe Arg Leu Asp Ala Tyr Ala 
45 290 295 300 

Leu He Val He Ser Leu Val Phe He Ala Cys Tyr Leu He Gly Tyr 
305 310 315 320 

Ala Tyr Leu Asn Arg Gin Val Cys Ala Leu Glu Lys Arg Ala Phe 
325 330 335 



50 



(2) INFORMATION FOR SEQ ID NO: 102; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 
55 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICTVL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

10 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .96 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



Met 


Lys 


Lys 


Val 


He 


Val 


Ala 


Leu 


Gly 


Val 


Leu 


Ala 


Phe 


Ala 


Asn 


Val 


1 




5 










10 










15 




Leu 


Met 


Ala 


Thr 


Asp 


Val 


Lys 


Ala 


Leu 


Val 


Lys 


Gly 


Cys 


Ala 


Ala 


Cys 








20 










25 










30 






His 


Gly 


Val 


Lys 


Phe 


Glu 


Lys 


Lys 


Ala 


Leu 


Gly 


Lys 


Ser 


Lys 


He 


Val 




35 










40 










45 








Asn 


Met 


Met 


Ser 


Glu 


Lys 


Glu 


He 


Glu 


Glu 


Asp 


Leu 


Met 


Ala 


Phe 


Lys 




50 










55 










60 










Ser 


Gly 


Ala 


Asn 


Lys 


Asn 


Pro 


Val 


Met 


Thr 


Ala 


Gin 


Ala 


Lys 


Lys 


Leu 


65 








70 










75 










80 


Ser 


Asp 


Glu 


Asp 


He 


Lys 


Ala 


Leu 


Ala 


Lys 


Tyr 


He 


Pro 


Thr 


Leu 


Lys 








85 










90 










95 




(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 103 : 

















15 



20 



25 



30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

( ii) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: YES 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
45 (B) LOCATION 1.. ,156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 

Met Arg Asp Phe Asn Asn He Gin He Thr Arg Leu Lys Val Arg Gin 
50 1 5 10 15 

Asn Ala Val Phe Glu Lys Leu Asp Leu Glu Phe Lys Asp Gly Leu Ser 

20 25 30 

Ala He Ser Gly Ala Ser Gly Val Gly Lys Ser Val Leu He Ala Ser 
35 40 45 

55 Leu Leu Gly Ala Phe Gly Leu Lys Glu Ser Asn Ala Ser Asn He Glu 
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50 

Val Glu Leu lie 
65 

Arg Glu Asp Glu 

5 

Lys Thr Arg Tyr 
100 

Lys Ala Leu Leu 
115 

10 Ser Gin Asn Glu 
130 

Tyr lie Gin Asn 
145 



55 

Ala Pro Phe Leu 
70 

His Glu Pro Leu 
85 

Phe Leu Asn Gin 

Lys Gl y Leu lie 
120 

Leu Asn Asp lie 
135 

Lys Asn Arg Arg 
150 
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60 

Asp Thr Glu Glu 
75 

Val lie Ser Val 
90 

Thr Ser Leu Ser 
105 

Lys Arg Leu Ser 

Leu Met Leu Ser 
14 0 

Leu Ala Pro Phe 
155 



Tyr Gly lie Phe 
80 

lie Lys Lys Glu 
95 

Lys Asn Thr Leu 
110 

Asn Asp Arg Phe 
125 

Leu Leu Asp Gly 



15 (2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE : amino acid 
20 (D) TOPOLOGY: linear 

(ii) MOLECtlLE TYPE: protein 

( i i i ) HYPOTHETI CAL : YES 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
30 (A) NAME/KEY: misc_feature 

(B) LOCATION l..,lia 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



35 Val Met Leu 
1 

Met Lys Lys 

Phe Cys Asn 
40 35 
Glu Val Gly 
50 

Met Glu Asn 
65 

45 Lys Glu Gin 
Gly Asn Asp 
Thr Tyr Phe 

50 115 



Met Ala lie Phe 
5 

Ser Met Ser Leu 
20 

Arg Asp lie Lys 

Leu Asp Ser Asn 
55 

His Leu lie Gly 
70 

Val Asp lie Arg 
85 

Thr Lys Lys Phe 
100 

His Arg Ser 



Thr Pro Tyr lie Leu 
10 

Phe Ala Asn Met Gly 
25 

Asp Leu Asn Asp Phe 
40 

Ala Arg Lys Asn Arg 
60 

Leu Phe Val Gin Ala 
75 

Glu Phe Glu Asp Leu 
90 

Asp Phe Val lie Phe 
105 



lie Leu Lys Met 
15 

Leu Glu Gin lie 
30 

Val Phe Gly He 
45 

Ser Arg Lys Ala 

Gin Leu Asn Phe 
80 

Arg Gin Ala Phe 
95 

Ser Lys Glu Lys 
110 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 35 5 amino acids 
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10 



20 



25 



30 



35 



40 



45 



50 



55 



-178- 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

( A) NAME /KEY: misc^fea tur e 

(B) LOCATION 1...355 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105; 



Met 


Asn 


He 


Lys 


He 


Leu 


Lys 


He 


Leu 


Val 


Gly 


Gly 


Leu 


Phe 


Phe 


Leu 


1 








5 










10 










15 




Ser 


Leu 


Asn 


Ala 
20 


His 


Leu 


Trp 


Gly 


Lys 
25 


Gin 


Asp 


Asn 


Ser 


Phe 
30 


Leu 


Gly 


He 


Gly 


Glu 


Arg 


Ala 


Tyr 


Lys 


Ser 


Gly Asn 


Tyr 


Ser 


Lys 


Ala 


Ala 


Ser 






35 










40 










4 5 








Tyr 


Phe 


Lys 


Lys 


Ala 


Cys 


Asn 


Asp 


Gly 


Val 


Ser 


Glu 


Gly 


Cys 


Thr 


Gin 


50 










55 










60 










Leu 


Gly 


He 


He 


Tyr 


Glu 


Asn 


Gly 


Gin 


Gly 


Thr 


Arg 


He 


Asp 


Tyr 


Lys 


65 








70 










75 










80 


Lys 


Ala 


Leu 


Glu 


Tyr 


Tyr 


Lys 


Thr 


Ala 


Cys 


Gin 


Ala 


Asp 


Asp 


Arg 


Glu 








85 










90 










95 




Gly 


Cys 


Phe 


Gly 


Leu 


Gly 


Gly 


Leu 


Tyr 


Asp 


Glu 


Gly 


Leu 


Gly 


Thr 


Ala 




100 










105 










X J. u 






Gin 


Asn 


Tyr 
115 


Gin 


Glu 


Ala 


He 


Asp 
120 


Ala 


Tyr 


Ala 


Lys 


Ala 
125 


Cys 


Val 


Leu 


Lys 


His 


Pro 


Glu 


Ser 


Cys 


Tyr 


Asn 


Leu 


Gly 


He 


He 


Tyr 


Asp 


Arg 


Lys 


130 










135 










14 0 










He 


Lys 


Gly 


Asn 


Ala 


Ala 


Gin 


Ala 


Val 


Thr 


Tyr 


Tyr 


Gin 


Lys 


Ser 


Cys 


145 






150 










155 










160 


Asn 


Phe 


Asp 


Met 


Ala 


Lys 


Gly 


Cys 


Tyr 


He 


Leu 


Gly 


Thr 


Ala 


Tyr 


Glu 








165 










170 










175 




Lys 


Gly 


Phe 


Leu 


Glu 


Val 


Lys 


Gin 


Ser 


Asn 


His 


Lys 


Ala 


Val 


He 


Tyr 




180 










185 










190 






Tyr 


Leu 


Lys 


Ala 


Cys 


Arg 


Leu 


Asn 


Glu 


Gly 


Gin 


Ala 


Cys 


Arg 


Ala 


Leu 




195 










200 










205 








Gly 


Ser 


Leu 


Phe 


Glu 


Asn 


Gly 


Asp 


Ala 


Gly 


Leu 


Asp 


Glu 


Asp 


Phe 


Glu 


210 










215 










220 








Gly 


Val 


Ala 


Phe 


Asp 


Tyr 


Leu 


Gin 


Lys 


Ala 


Cys 


Ala 


Leu 


Asn 


Asn 


Ser 


225 










230 










235 










240 


Gly 


Cys 


Ala 


Ser 


Leu 


Gly 


Ser 


Met 


Tyr 


Met 


Leu 


Gly 


Arg 


Tyr 


Val 


Lys 






245 










250 










255 




Lys 


Asp 


Pro 


Gin 


Lys 


Ala 


Phe 


Asn 


Tyr 


Phe 


Lys 


Gin 


Ala 


Cys 


Asp 


Met 




260 










265 










270 






Gly 


Ser 


Ala 


Val 


Ser 


Cys 


Ser 


Arg 


Met 


Gly 


Phe 


Met 


Tyr 


Ser 


Gin 


Gly 




275 










280 










285 








Asp 


Thr 


Val 


Ser 


Lys 


Asp 


Leu 


Arg 


Lys 


Ala 


Leu 


Asp 


Asn 


Tyr 


Glu 


Arg 


290 










295 










300 










Gly 


Cys 


Asp 


Met 


Gly 


Asp 


Glu 


Val 


Gly 


Cys 


Phe 


Ala 


Leu 


Ala 


Gly 


Met 
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305 310 315 320 

Tyr Tyr Asn Met Lys Asp Lys Glu Asn Ala lie Met lie Tyr Asp Lys 

325 330 335 

Gly Cys Lys Leu Gly Met Lys Gin Ala Cys Glu Asn Leu Thr Lys Leu 
340 345 350 

Arg Gly Tyr 
355 

(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



" (li) "MOLECUL"E~TYPE: protein 

(iii) HYPOTHETICAL: YES 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
25 (B) LOCATION 1...193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 

Met Lys Glu Lys Asn Phe Trp Pro Leu Gly lie Met Ser Val Leu lie 
30 1 5 10 15 

Phe Gly Leu Gly lie Val Val Phe Leu Val Val Phe Ala Leu Lys Asn 

20 25 30 

Ser Pro Lys Asn Asp Leu Val Tyr Phe Lys Gly His Asn Glu Val Asp 
35 40 45 

35 Leu Asn Phe Asn Ala Met Leu Lys Thr Tyr Glu Asn Phe Lys Ser Asn 
50 55 60 

Tyr Arg Phe Ser Val Gly Leu Lys Pro Leu Thr Glu Ser Pro Lys Thr 
65 70 75 80 

Pro lie Leu Pro Tyr Phe Ser Lys Gly Thr His Gly Asp Lys Lys He 
40 85 90 95 

Gin Glu Asn Leu Leu Asn Asn Ala Leu He Leu Glu Lys Ser Asn Thr 

100 105 110 

Leu Tyr Ala Gin Leu Gin Pro Leu Lys Pro Ala Leu Asp Ser Pro Asn 
115 120 125 

45 He Gin Val Tyr Leu Ala Phe Tyr Pro Ser Gin Ser Gin Pro Arg Leu 
130 135 140 

Leu Gly Thr Leu Asp Cys Lys Asn Ala Cys Glu Pro Leu Lys Phe Asp 
145 150 155 160 

Leu Leu Glu Gly Asp Lys Val Gly Arg Tyr Lys He Leu Phe Lys Phe 
50 165 170 175 

Val Phe Lys Asn Lys Glu Glu Leu He Leu Glu Gin Leu Ala Phe Phe 
180 185 190 

Lys 

55 
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(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 9 amino acids 
5 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECOLE TYPE: protein 

10 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

15 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...289 



20 



25 



30 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



Leu 


Gly 


He 


Asn 


Met 


Cys 


Ser 


Lys 


Lys 


He 


Arg 


Asn 


Leu 


He 


Leu 


Cys 


1 






5 










10 










15 




Phe 


Gly 


Phe 


He 


Leu 


Ser 


Leu 


Cys 


Ala 


Glu 


Glu 


Asn 


He 


Thr 


Lys 


Glu 






20 










25 










30 






Asn 


Met 


Thr 
35 


Glu 


Thr 


Asn 


Thr 


Thr 
40 


Glu 


Glu 


Asn 


Thr 


Pro 


Lys 


Asp 


Ala 


Pro 


He 
50 


Leu 


Leu 


Glu 


Glu 


Lys 
55 


Arg 


Ala 


Gin 


Thr 


Leu 
60 


Glu 


Leu 


Lys 


Glu 


Glu 


Asn 


Glu 


Val 


Ala 


Lys 


Lys 


He 


Asp 


Glu 


Lys 


Ser 


Leu 


Leu 


Glu 


Glu 


65 










70 










75 










80 


He 


His 


Lys 


Lys 


Lys 
85 


Arg 


Gin 


Leu 


Tyr 


Met 
90 


Leu 


Lys 


Gly 


Glu 


Leu 
95 


His 


Glu 


Lys 


Asn 


Glu 


Ser 


He 


Leu 


Phe 


Gin 


Gin 


Met 


Ala 


Lys 


Asn 


Lys 


Ser 






100 










105 










110 






Gly 


Phe 


Phe 


He 


Gly 


Val 


He 


Leu 


Gly 


Asp 


He 


Gly 


He 


Asn 


Ala 


Asn 




115 










120 










125 








Pro 


Tyr 


Glu 


Lys 


Phe 


Glu 


Leu 


Leu 


Ser 


Asn 


He 


Gin 


Ala 


Ser 


Pro 


Leu 




130 










135 










140 








He 


Leu 


Tyr 


Gly 


Leu 


Arg 


Ser 


Gly 


Tyr 


Gin 


Lys 


Tyr 


Phe 


Ala 


Asn 


Gly 


145 








150 










155 










160 


Ser 


Ala 


Leu 


Arg 


Phe 
165 


Tyr 


Gly 


Glu 


Tyr 


Leu 
170 


Gly 


Gly 


Ala 


Met 


Lys 
175 


Gly 


Phe 


Lys 


Ser 


Asp 


Ser 


Leu 


Ala 


Ser 


Tyr 


Gin 


Thr 


Ala 


Ser 


Leu 


Asn 


He 






180 










185 










190 






Asp 


Leu 


Leu 


Met 


Asp 


Lys 


Pro 


He 


Asp 


Lys 


Glu 


Lys 


Arg 


Phe 


Ala 


Leu 




195 










200 










205 








Gly 


He 


Phe 


Gly 


Gly 


Val 


Gly Val 


Gly 


Trp 


Asn 


Gly 


Met 


Tyr 


Gin 


Asn 




210 










215 










220 










Leu 


Lys 


Glu 


He 


Arg 


Gly 


Tyr 


Ser 


Gin 


Pro 


Asn 


Ala 


Phe 


Gly 


Leu 


Val 


225 








230 










235 










240 


Leu 


Asn 


Leu 


Gly 


Val 


Ser 


Met 


Thr 


Leu 


Asn 


Leu 


Lys 


His 


Arg 


Phe 


Glu 








245 










250 










255 




Leu 


Ala 


Leu 


Lys 
260 


Met 


Pro 


Pro 


Leu 


Lys 
265 


Glu 


Thr 


Ser 


Gin 


Thr 
270 


Phe 


Leu 


Tyr 


Tyr 


Phe 


Lys 


Ser 


Thr 


Asn 


He 


Tyr 


Tyr 


He 


Ser 


Tyr 


Asn 


Tyr 


Leu 
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275 280 285 



Leu 



5 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 amino acids 

(B) TYPE: amino acid 
10 ( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

15 



Ivi") ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 
20 (A) NAME/KEY: misc_feature 

(B) LOCATION 1...668 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

25 Met Arg Lys Leu Phe lie Pro Leu Leu Leu Phe Ser Ala Leu Glu Ala 
15 10 15 

Asn Glu Lys Asn Gly Phe Phe lie Glu Ala Gly Phe Glu Thr Gly Leu 

20 25 30 

Leu Glu Gly Thr Gin Thr Gin Glu Lys Arg His Thr Thr Thr Lys Asn 
30 35 40 45 

Thr Tyr Ala Thr Tyr Asn Tyr Leu Pro Thr Asp Thr lie Leu Lys Arg 

50 55 60 

Ala Ala Asn Leu Phe Thr Asn Ala Glu Ala lie Ser Lys Leu Lys Phe 
65 70 75 80 

35 Ser Ser Leu Ser Pro Val Arg Val Leu Tyr Met Tyr Asn Gly Gin Leu 

85 90 95 

Thr lie Glu Asn Phe Leu Pro Tyr Asn Leu Asn Asn Val Lys Leu Ser 

100 105 110 

Phe Thr Asp Ala Gin Gly Asn Thr lie Asp Leu Gly Val lie Glu Thr 
40 115 120 125 

lie Pro Lys His Ser Lys lie Val Leu Pro Gly Glu Ala Phe Asp Ser 

130 135 140 

Leu Lys Glu Ala Phe Asp Lys lie Asp Pro Tyr Thr Leu Phe Leu Pro 
145 150 155 160 

45 Lys Phe Glu Ala Thr Ser Thr Ser lie Ser Asp Thr Asn Thr Gin Arg 

165 170 175 

Val Phe Glu Thr Leu Asn Asn lie Lys Thr Asn Leu lie Met Lys Tyr 

180 185 190 

Ser Asn Glu Asn Pro Asn Asn Phe Asn Thr Cys Pro Tyr Asn Asn Asn 
50 195 200 205 

Gly Asn Thr Lys Asn Asp Cys Trp Gin Asn Phe Thr Pro Gin Thr Ala 

210 215 220 

Glu Glu Phe Thr Asn Leu Met Leu Asn Met lie Ala Val Leu Asp Ser 
225 230 235 240 

55 Gin Ser Trp Gly Asp Ala lie Leu Asn Ala Pro Phe Glu Phe Thr Asn 
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245 250 255 

Ser Ser Thr Asp Cys Asp Ser Asp Pro Ser Lys Cys Val Asn Pro Gly 

260 265 270 

Val Asn Gly Arg Val Asp Thr Lys Val Asp Gin Gin Tyr lie Leu Asn 
5 275 280 285 

Lys Gin Gly He He Asn Asn Phe Arg Lys Lys He Glu He Asp Ala 

290 295 300 

Val Val Leu Lys Asn Ser Gly Val Val Gly Leu Ala Asn Gly Tyr Gly 
305 3X0 315 320 

10 Asn Asp Gly Glu Tyr Gly Thr Leu Gly Val Glu Ala Tyr Ala Leu Asp 

325 330 335 

Pro Lys Lys Leu Phe Gly Asn Asp Leu Lys Thr He Asn Leu Glu Asp 

340 345 350 

Leu Arg Thr He Leu His Glu Phe Ser His Thr Lys Gly Tyr Gly His 
15 355 360 365 

Asn Gly Asn Met Thr Tyr Gin Arg Val Pro Val Thr Lys Asp Gly Gin 

370 375 380 

Val Glu Lys Asp Ser Asn Gly Lys Pro Lys Asp Ser Asp Gly Leu Pro 
385 390 395 400 

20 Tyr Asn Val Cys Ser Leu Tyr Gly Gly Ser Asn Gin Pro Ala Phe Pro 

405 410 415 

Ser Asn Tyr Pro Asn Ser He Tyr His Asn Cys Ala Asp Val Pro Ala 

420 425 430 

Gly Phe Leu Gly Val Thr Ala Ala Val Trp Gin Gin Leu He Asn Gin 
25 435 440 445 

Asn Ala Leu Pro He Asn Tyr Ala Asn Leu Gly Ser Gin Thr Asn Tyr 

450 455 460 

Asn Leu Asn Ala Ser Leu Asn Thr Gin Asp Leu Ala Asn Ser Met Leu 
465 470 475 480 

30 ser Thr He Gin Lys Thr Phe Val Thr Ser Ser Val Thr Asn His His 

485 490 495 

Phe ser Asn Ala Ser Gin Ser Phe Arg Ser Pro He Leu Gly Val Asn 

500 505 510 

Ala Lys He Gly Tyr Gin Asn Tyr Phe Asn Asp Phe He Gly Leu Ala 
35 515 520 525 

Tyr Tyr Gly He He Lys Tyr Asn Tyr Ala Lys Ala Val Asn Gin Lys 

530 535 540 

Val Gin Gin Leu Ser Tyr Gly Gly Gly He Asp Leu Leu Leu Asp Phe 
545 550 555 560 

40 He Thr Thr Tyr Ser Asn Lys Asn Ser Pro Thr Gly He Gin Thr Lys 

565 570 575 

Arg Asn Phe Ser Ser Ser Phe Gly He Phe Gly Gly Leu Arg Gly Leu 

580 585 590 

Tyr Asn Ser Tyr Tyr Val Leu Asn Lys Val Lys Gly Ser Gly Asn Leu 
45 595 600 605 

Asp Val Ala Thr Gly Leu Asn Tyr Arg Tyr Lys His Ser Lys Tyr Ser 

610 615 620 

Val Gly He Ser He Pro Leu He Gin Arg Lys Ala Ser Val Val Ser 
625 630 635 640 

50 Ser Gly Gly Asp Tyr Thr Asn Ser Phe Val Phe Asn Glu Gly Ala Ser 

645 650 655 

His Phe Lys Val Phe Phe Asn Tyr Gly Trp Val Phe 
660 665 

55 (2) INFORMATION FOR SEQ ID NO: 109: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



( ix ) FEATURE : 
15 (A) NAME/KEY: misc_feature 

{-E) -LGGAT-IGN— 1- — 6-3 - - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: 

20 Met Asn Thr Glu lie Leu Thr lie Met Leu Val Val Ser Val Leu Met 
15 10 15 

Gly Leu Val Gly Leu lie Ala Phe Leu Trp Gly Val Lys Ser Gly Gin 

20 25 30 

Phe Asp Asp Glu Lys Arg Met Leu Glu Ser Val Leu Tyr Asp Ser Ala 
25 35 40 45 

Ser Asp Leu Asn Glu Ala lie Leu Gin Glu Lys Arg Gin Lys Asn 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL; YES 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
45 (B) LOCATION 1.. .406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Met Val Phe Phe His Lys Lys lie lie Leu Asn Phe lie Tyr Ser Leu 

50 1 5 10 15 

Met Val Ala Phe Leu Phe His Leu Ser Tyr Gly Val Leu Leu Lys Ala 

20 25 30 

Asp Gly Met Ala Lys Lys Gin Thr Leu Leu Val Gly Glu Arg Leu Val 

35 40 45 

55 Trp Asp Lys Leu Thr Leu Leu Gly Phe Leu. Glu Lys Asn His lie Pro 
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50 55 60 

Gin Lvs Leu Tyr Tyr Asn Leu Ser Ser Gin Asp Lys Glu Leu Ser Ala 
65 70 75 80 

Glu lie Gin Ser Asn Val Thr Tyr Tyr Thr Leu Arg Asp Ala Asn Asn 
5 85 90 95 

Thr Leu He Gin Ala Leu He Pro He Ser Gin Asp Leu Gin He His 

100 105 110 

He Tyr Lys Lys Gly Glu Asp Tyr Phe Leu Asp Phe He Pro He Val 
115 120 125 

10 Phe Thr Arg Lys Glu Arg Thr Leu Leu Leu Ser Leu Gin Thr Ser Pro 
130 135 140 

Tyr Gin Asp He Val Lys Ala Thr Asn Asp Pro Leu Leu Ala Asn Gin 
145 150 155 160 

Leu Met Asn Ala Tyr Lys Lys Ser Val Pro Phe Lys Arg Leu Val Lys 
15 165 170 175 

Asn Asp Lys He Ala He Val Tyr Thr Arg Asp Tyr Arg Val Gly Gin 

180 185 190 

Ala Phe Gly Gin Pro Thr He Lys Met Ala Met Val Ser Ser Arg Leu 
195 200 205 

20 His Gin Tyr Tyr Leu Phe Ser His Ser Asn Gly Arg Tyr Tyr Asp Ser 
210 215 220 

Lys Ala Gin Glu Val Ala Gly Phe Leu Leu Glu Thr Pro Val Lys Tyr 
225 230 235 240 

Thr Arg He Ser Ser Pro Phe Ser Tyr Gly Arg Phe His Pro Val Leu 
25 245 250 255 

Lys Val Lys Arg Pro His Tyr Gly Val Asp Tyr Ala Ala Lys His Gly 

260 265 270 

Ser Leu He His Ser Ala Ser Asp Gly Arg Val Gly Phe He Gly Val 
275 280 285 

30 Lys Ala Gly Tyr Gly Lys Val Val Glu He His Leu Asn Glu Leu Arg 
290 295 300 

Leu Val Tyr Ala His Met Ser Ala Phe Ala Asn Gly Leu Lys Lys Gly 
305 310 315 320 

Ser Phe Val Lys Lys Gly Gin He He Gly Arg Val Gly Ser Thr Gly 
35 325 330 335 

Leu Ser Thr Gly Pro His Leu His Phe Gly Val Tyr Lys Asn Ser Arg 

340 345 350 

Pro He Asn Pro Leu Gly Tyr He Arg Thr Ala Lys Ser Lys Leu His 
355 360 365 

40 Gly Lys Gin Arg Glu Val Phe Leu Glu Lys Ala Gin Tyr Ser Lys Gin 
370 375 380 

Lys Leu Glu Glu Leu Phe Lys Thr His Ser Phe Glu Lys Asn Ser Phe 
385 390 395 400 

Tyr Leu Leu Glu Gly Phe 
45 405 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 2 96 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



55 



(ii) MOLECULE TYPE: protein 
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( i i i ) HYPOTHET I CAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

5 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...296 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Leu Phe Leu Val Lys Lys lie Gly Val Val He Met He Leu Val Cys 
15 10 15 

Phe Leu Ala Cys Ser Gin Glu Ser Phe He Lys Met Gin Lys Lys Ala 
15 20 25 30 



" G In G 1 u"~G 1 n~G 1 u~ "^AsrT^Asp G ly^s er^Lys^'Arg^ Pro "Ser Tyr Val Asp Ser 
35 40 45 

Asp Tyr Glu Val Phe Ser Glu Thr He Phe Leu Gin Asn Met Val Tyr 
50 55 60 

20 Gin Pro He Glu Glu Arg Asn Ala Phe Phe Gin Leu Thr Lys Asp Glu 
65 70 75 80 

Asp Asn Ser Phe Asn Pro Glu Asn Ser Val He Leu Leu Asn Glu Pro 

85 90 95 

Ser Asp Asn Ser Glu Lys Asn Leu Leu Ser Tyr Pro Asn Asp Pro Asn 
25 100 105 110 

Asn Asn Glu Asp Asn Ala Asn Asn Ser Gin Lys Asn Pro Phe Leu Tyr 

115 120 125 

Lys Pro Lys Arg Lys Thr Lys Asn Pro Lys Leu He Glu Tyr Ser Gin 
130 135 140 

30 Gin Asp Phe Tyr Pro Leu Lys Asn Gly Asp He He Met Ser Lys Glu 
145 150 155 160 

Gly Asp Gin Trp Leu He Glu He Gin Ser Lys Ala Leu Lys Arg Phe 

165 170 175 

Leu Lys Asp Gin Asn Asp Lys Asp Arg Gin He Gin Thr Phe Thr Phe 
35 180 185 190 

Asn Asp Thr Lys Thr Gin He Ala Gin He Lys Gly Lys He Ser Ser 

195 200 205 

Tyr Val Tyr Thr Thr Asn Asn Gly Ser Leu Ser Leu Arg Pro Phe Tyr 
210 215 220 

40 Glu Ser Phe Leu Leu Glu Lys Lys Ser Asp Asn Val Tyr Thr He Glu 
225 230 235 240 

Asn Lys Ala Leu Asp Thr Met Glu He Ser Lys Cys Gin Met Val Leu 

245 250 255 

Lys Lys His Ser Thr Asp Lys Leu Asp Ser Gin His Lys Ala He Ser 
45 260 265 270 

He Asp Leu Asp Phe Lys Lys Glu Arg Phe Lys Ser Asp Thr Glu Leu 

275 280 285 

Phe Leu Glu Cys Leu Lys Glu Ser 
290 295 



50 



(2) INFORMATION FOR SEQ ID NO: 112: 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 



wo 98/18323 



PCT/US97/19575 



-186- 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

10 (ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1...248 



15 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:112:% 



Val Ser Tyr Asp Asn Thr Asp Asp Tyr Tyr Phe Pro Arg Asn Gly Val 
15 10 15 

lie Phe Ser Ser Tyr Ala Thr Met Ser Gly Leu Pro Ser Ser Gly Thr 
20 25 30 

20 Leu Asn Ser Trp Asn Gly Leu Gly Gly Asn Val Arg Asn Thr Lys Val 
35 40 45 

Tyr Gly Lys Phe Ala Ala Tyr His His Leu Gin Lys Tyr Leu Leu lie 

50 55 60 

Asp Leu lie Ala Arg Phe Lys Thr Gin Gly Gly Tyr lie Phe Arg Tyr 
25 65 70 75 80 

Asn Thr Asp Asp Tyr Leu Pro Leu Asn Ser Thr Phe Tyr Met Gly Gly 

85 90 95 

Val Thr Thr Val Arg Gly Phe Arg Asn Gly Ser lie Thr Pro Lys Asp 
100 105 110 

30 Glu Phe Gly Leu Trp Leu Gly Gly Asp Gly lie Phe Thr Ala Ser Thr 
115 120 125 

Glu Leu Ser Tyr Gly Val Leu Lys Ala Ala Lys Met Arg Leu Ala Trp 

130 135 140 

Phe Phe Asp Phe Gly Phe Leu Thr Phe Lys Thr Pro Thr Arg Gly Ser 
35 145 150 155 160 

Phe Phe Tyr Asn Ala Pro Thr Thr Thr Ala Asn Phe Lys Asp Tyr Gly 

165 170 175 

Val Val Gly Ala Gly Phe Glu Arg Ala Thr Trp Arg Ala Ser Thr Gly 
180 185 190 

40 Leu Gin lie Glu Trp lie Ser Pro Met Gly Pro Leu Val Leu He Phe 
195 200 205 

Pro He Ala Phe Phe Asn Gin Trp Gly Asp Gly Asn Gly Lys Lys Cys 

210 215 220 

Lys Gly Leu Cys Phe Asn Pro Asn Met Asn Asp Tyr Thr Gin His Phe 
45 225 230 . 235 240 

Glu Phe Ser Met Gly Thr Arg Phe 
245 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
10 (B) LOCATION 1...335 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:113: 

Val Gin His Phe Asn Phe Leu Tyr Lys Asp Ser Leu Phe Ser lie Ala 
15 1 5 10 15 



— Leu Phe— Thr— Phe" "He' I I'e' Ala^Leu" Va 1" I le" Leu Leu Glu Glri TAlaT Ar g 
20 25 30 

Ala Tyr Phe Thr Arg Lys Arg Asn Lys Lys Phe Leu Gin Lys Phe Ala 
35 40 45 

20 Gin Asn Gin Asn Ala Tyr Ala Ser Ser Glu Asn Leu Asp Glu Leu Leu 
50 55 60 

Lys His Ala Lys lie Ser Ser Leu Met Phe Leu Ala Arg Ala Tyr Ser 
65 70 75 80 

Lys Ala Asp Val Glu Met Ser lie Glu lie Leu Lys Gly Leu Leu Asn 
25 85 90 95 

Arg Pro Leu Lys Asp Glu Glu Lys lie Ala Val Leu Asp Leu Leu Ala 

100 105 110 

Lys Asn Tyr Phe Ser Val Gly Tyr Leu Gin Lys Thr Lys Asp Thr Val 
115 120 125 

30 Lys Glu He Leu Arg Phe Ser Pro Arg Asn Val Glu Ala Leu Leu Lys 
130 135 140 

Leu Leu His Ala Tyr Glu Leu Glu Lys Asp Tyr Ser Lys Ala Leu Glu 
145 150 155 160 

Thr Leu Glu Cys Leu Glu Glu Leu Glu Val Pro Lys He Glu Thr He 
35 165 170 175 

Lys Asn Tyr Leu Tyr Leu Met His Leu lie Glu Asn Lys Glu Asp Ala 

180 185 190 

Ala Lys He Leu His Val Ser Lys Ala Ser Leu Asp Leu Lys Lys He 
195 200 205 

40 Ala Leu Asn His Leu Lys Ser His Asp Glu Asn Leu Phe Trp Gin Glu 
210 215 220 

He Asp Thr Thr Glu Arg Leu Glu Asn Val He Asp Leu Leu Trp Asp 
225 230 235 240 

Met Asn He Pro Ala Phe He Leu Glu Lys His Ala Leu Leu Gin Asp 
45 245 250 255 

He Ala Arg Ser Gin Gly Leu Leu Leu Asp His Lys Pro Cys Gin He 

260 265 270 

Phe Glu Leu Glu Val Leu Arg Ala Leu Leu His Ser Pro He Lys Ala 
275 280 285 

50 Ser Leu Thr Phe Glu Tyr Arg Cys Lys His Cys Lys Gin He Phe Pro 
290 295 300 

Phe Glu Ser His Arg Cys Pro Val Cys Tyr Gin Leu Ala Phe Met Asp 
305 310 315 320 

Met Val Leu Lys He Ser Lys Lys Thr His Ala Met Gly Val Asp 
55 325 330 335 
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10 



15 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .413 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Met Arg Lys lie Phe Ser Tyr lie Ser Lys Val Leu Leu Phe lie Gly 
1 5 10 15 

Val Val Tyr Ala Glu Pro Asp Ser Lys Val Glu Ala Leu Glu Gly Arg 
25 20 25 30 

Lys Gin Glu Ser Ser Leu Asp Lys Lys He Arg Gin Glu Leu Lys Ser 

35 40 45 

Lys Glu Leu Lys Asn Lys Glu Leu Lys Asn Lys Asp Leu Lys Asn Lys 
50 55 60 

30 Glu Glu Lys Lys Glu Thr Lys Ala Lys Arg Lys Pro Arg Ala Glu Val 
65 70 75 80 

His His Gly Asp Ala Lys Asn Pro Thr Pro Lys He Thr Pro Pro Lys 

85 90 95 

He Lys Gly Ser Ser Lys Gly Val Gin Asn Gin Gly Val Gin Asn Asn 
35 100 105 110 

Ala Pro Lys Pro Glu Glu Lys Asp Thr Thr Pro Gin Ala Thr Glu Lys 

115 120 125 

Asn Lys Glu Thr Ser Pro Ser Ser Gin Phe Asn Ser He Phe Gly Asn 
130 135 140 

40 Pro Asn Asn Ala Thr Asn Asn Thr Leu Glu Asp Lys Val Val Gly Gly 
145 150 155 160 

He Ser Leu Leu Val Asn Gly Ser Pro He Thr Leu Tyr Gin He Gin 

165 170 175 

Glu Glu Gin Glu Lys Ser Lys Val Ser Lys Ala Gin Ala Arg Asp Arg 
45 ISO 185 190 

Leu He Ala Glu Arg He Lys Asn Gin Glu He Glu Arg Leu Lys He 

195 200 205 

His Val Asp Asp Asp Lys Leu Asp Gin Glu Met Ala Met Met Ala Gin 
210 215 220 

50 Gin Gin Gly Met Asp Leu Asp His Phe Lys Gin Met Leu Met Ala Glu 
225 230 235 240 

Gly His Tyr Lys Leu Tyr Arg Asp Gin Leu Lys Glu His Leu Glu Met 

245 250 255 

Gin Glu Leu Leu Arg Asn He Leu Leu Thr Asn Val Asp Thr Ser Ser 
55 260 265 270 
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Glu Ttir Lys Met Arg Glu Tyr Tyr Asn Lys His Lys Glu Gin Phe Ser 

275 280 285 

lie Pro Thr Glu lie Glu Thr Val Arg Tyr Thr Ser Thr Asn Gin Glu 
290 295 300 

5 Asp Leu Glu Arg Ala Met Ala Asp Pro Asn Leu Glu Val Pro Gly Val 
305 310 315 320 

Ser Lys Ala Asn Glu Lys lie Glu Met Lys Thr Leu Asn Pro Gin lie 

325 330 335 

Ala Gin Val Phe lie Ser His Glu Gin Gly Ser Phe Thr Pro Val Met 
10 340 345 350 

Asn Gly Gly Gly Gly Gin Phe lie Thr Phe Tyr lie Lys Glu Lys Arg 

355 360 365 

Gly Lys Asn Glu Val Ser Phe Ser Gin Ala Lys Gin Phe lie Ala Gin 
370 375 380 

15 Lys Leu Val Glu Glu Ser Lys Asp Lys lie Leu Glu Glu His Phe Glu 
3T5 3^90^ " 395 '~ 400 

Lys Leu Arg Val Lys Ser Arg lie Val Met lie Arg Glu 
405 410 

20 (2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
35 (A) NAME/KEY: misc_feature 

(B) LOCATION 1...186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: 

40 Met lie Lys Arg lie Ala Cys lie Leu Ser Leu Ser Ala Ser Leu Ala 
15 10 15 

Leu Ala Gly Glu Val Asn Gly Phe Phe Met Gly Ala Gly Tyr Gin Gin 

20 25 30 

Gly Arg Tyr Gly Pro Tyr Asn Ser Asn Tyr Ser Asp Trp Arg His Gly 
45 35 40 45 

Asn Asp Leu Tyr Gly Leu Asn Phe Lys Leu Gly Phe Val Gly Phe Ala 

50 55 60 

Asn Lys Trp Phe Gly Ala Arg Val Tyr Gly Phe Leu Asp Trp Phe Asn 
65 70 75 80 

50 Thr Ser Gly Thr Glu His Thr Lys Thr Asn Leu Leu Thr Tyr Gly Gly 

85 90 95 

Gly Gly Asp Leu lie Val Asn Leu lie Pro Leu Asp Lys Phe Ala Leu 

100 105 110 

Gly Leu lie Gly Gly Val Gin Leu Ala Gly Asn Thr Trp Met Phe Pro 
55 115 120 125 
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Tyr Asp Val Asn Gin Thr Arg Phe Gin Phe Leu Trp Asn Leu Gly Gly 

130 135 140 

Arg Met Arg Val Gly Asp Arg Ser Ala Phe Glu Ala Gly Val Lys Phe 
145 150 155 160 

5 pro Met Val Asn Gin Gly Ser Lys Asp Val Gly Leu He Arg Tyr Tyr 

165 170 175 

Ser Trp Tyr Val Asp Tyr Val Phe Thr Phe 
180 185 

10 (2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



20 



(vi) 0RIGIN;U^ SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
25 (A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .242 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

30 Met Lys Lys Phe Phe Ser Gin Ser Leu Leu Ala Leu lie lie Ser Met 
15 10 15 

Asn Ala Val Ser Gly Met Asp Gly Asn Gly Val Phe Leu Gly Ala Gly 

20 25 30 

Tyr Leu Gin Gly Gin Ala Gin Met His Ala Asp lie Asn Ser Gin Lys 
35 35 40 45 

Gin Ala Thr Asn Ala Thr lie Lys Gly Phe Asp Ala Leu Leu Gly Tyr 

50 55 60 

Gin Phe Phe Phe Glu Lys His Phe Gly Leu Arg Leu Tyr Gly Phe Phe 
65 70 75 80 

40 Asp Tyr Ala His Ala Asn Ser lie Lys Leu Lys Asn Pro Asn Tyr Asn 

85 90 95 

Ser Glu Ala Ala Gin Val Ala Ser Gin lie Leu Gly Lys Gin Glu lie 

100 105 110 

Asn Arg Leu Thr Asn lie Ala Asp Pro Arg Thr Phe Glu Pro Asn Met 
45 115 120 125 

Leu Thr Tyr Gly Gly Ala Met Asp Val Met Val Asn Val lie Asn Asn 

130 135 140 

Gly lie Met Ser Leu Gly Ala Phe Gly Gly lie Gin Leu Ala Gly Asn 
145 150 155 160 

50 Ser Trp Leu Met Ala Thr Pro Ser Phe Glu Gly lie Leu Val Glu Gin 

165 170 175 

Ala Leu Val Ser Lys Lys Ala Thr Ser Phe Gin Phe Leu Phe Asn Val 

180 185 190 

Gly Ala Arg Leu Arg lie Leu Lys His Ser Ser lie Glu Ala Gly Val 
55 195 200 205 
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Lys Phe Pro Met Leu Lys Lys Asn Pro Tyr lie Thr Ala Lys Asn Leu 

210 215 220 

Asp lie Gly Phe Arg Arg Val Tyr Ser Trp Tyr Val Asn Tyr Val Phe 
225 230 235 240 

5 Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 117: 



10 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

15 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1...256 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 





Met 


Gly 


Tyr 


Ala 


Ser 


Lys 


Leu 


Ala 


Leu 


Lys 


He 


Cys 


Leu 


Val 


Gly 


Leu 




1 








5 










10 










15 




30 


Cys 


Leu 


Phe 


Ser 
20 


Thr 


Leu 


Gly 


Ala 


Glu 
25 


His 


Leu 


Glu 


Gin 


Lys 
30 


Gly 


Asn 




Tyr 


He 


Tyr 
35 


Lys 


Gly 


Glu 


Glu 


Ala 
40 


Tyr 


Asn 


Asn 


Lys 


Glu 
45 


Tyr 


Glu 


Arg 




Ala 


Ala 


Ser 


Phe 


Tyr 


Lys 


Ser 


Ala 


He 


Lys 


Asn 


Gly 


Glu 


Ser 


Leu 


Ala 


35 




50 










55 










60 












Tyr 


He 


Leu 


Leu 


Gly 


He 


Met 


Tyr 


Glu 


Asn 


Gly Arg 


Gly Val 


Pro 


Lys 




65 










70 










75 










80 




-Asp 


Tyr 


Lys 


Lys 


Ala 
85 


Val 


Glu 


Tyr 


Phe 


Gin 
90 


Lys 


Ala 


Val 


Asp 


Asn 
95 


Asp 


40 


lie 


Pro 


Arg 


Gly 


Tyr 


Asn 


Asn 


Leu 


Gly Val 


Met 


Tyr 


Lys 


Glu 


Gly 


Lys 










100 










105 










110 








Gly Val 


Pro 


Lys 


Asp 


Glu 


Lys 


Lys 


Ala 


Val 


Glu 


Tyr 


Phe 


Arg 


He 


Ala 








115 










120 










125 








45 


Thr 


Glu 


Lys 


Gly Tyr 


Thr 


Asn 


Ala 


Tyr 


He 


Asn 


Leu 


Gly 


He 


Met 


Tyr 




130 










13 5 










140 












Met 


Glu 


Gly 


Arg 


Gly 


Val 


Pro 


Ser 


Asn 


Tyr 


Ala 


Lys 


Ala 


Thr 


Glu 


Cys 




145 










150 










155 










160 




Phe 


Arg 


Lys 


Ala 


Met 


His 


Lys 


Gly Asn 


Val 


Glu 


Ala 


Tyr 


He 


Leu 


Leu 












165 










170 










175 




50 


Gly Asp 


He 


Tyr 


Tyr 


Ser 


Gly 


Asn 


Asp 


Gin 


Leu 


Gly 


He 


Glu 


Pro 


Asp 










180 










185 










190 








Lys 


Asp 


Lys 
195 


Ala 


Val 


Val 


Tyr 


Tyr 
200 


Lys 


Met 


Ala 


Ala 


Asp 
205 


Val 


Ser 


Ser 


55 


Ser 


Arg 


Ala 


Tyr 


Glu 


Gly 


Leu 


Ser 


Glu 


Ser 


Tyr 


Arg 


Tyr 


Gly 


Leu 


Gly 




210 










215 










220 
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Val Glu Lys Asp Lys Lys Lys Ala Glu Glu Tyr Met Gin Lys Ala Cys 
225 230 235 240 

Asp Phe Asp lie Asp Lys Asn Cys Lys Lys Lys Asn Thr Ser Ser Arg 
245 250 255 

5 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 
10 (B) TYPE: amino acid 

{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

20 (ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1. . .657 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 





Met 


Arg 


Lys 


Leu 


Phe 


He 


Pro 


Leu 


Leu 


Leu 


Phe 


Ser 


Ala 


Leu 


Glu 


Ala 




1 




5 










10 










15 






Asn 


Glu 


Lys 


Asn 


Gly 


Phe 


Phe 


He 


Glu 


Ala 


Gly 


Phe 


Glu 


Thr 


Gly 


Leu 








20 










25 










30 






30 


Leu 


Glu 


Gly 


Thr 


Gin 


Thr 


Gin 


Glu 


Lys 


Arg 


His 


Thr 


Thr 


Thr 


Lys 


Asn 






35 










40 










45 










Thr 


Tyr 
50 


Ala 


Thr 


Tyr 


Asn 


Tyr 
55 


Leu 


Pro 


Thr 


Asp 


Thr 
60 


He 


Leu 


Lys 


Arg 




Ala 


Ala 


Asn 


Leu 


Phe 


Thr 


Asn 


Ala 


Glu 


Ala 


He 


Ser 


Lys 


Leu 


Lys 


Phe 


35 


65 










70 










75 








Gin 


80 




Ser 


Ser 


Leu 


Ser 


Pro 


Val 


Arg 


Val 


Leu 


Tyr 


Met 


Tyr 


Asn 


Gly 


Leu 










85 










90 










95 






Thr 


He 


Glu 


Asn 
100 


Phe 


Leu 


Pro 


Tyr 


Asn 
105 


Leu 


Asn 


Asn 


Val 


Lys 
110 


Leu 


Ser 


40 


Phe 


Thr 


Asp 


Ala 


Gin 


Gly 


Asn 


Val 


He 


Asp 


Leu 


Gly Val 


He 


Glu 


Thr 








115 










120 










125 










He 


Pro 


Lys 


His 


Ser 


Lys 


He 


Val 


Leu 


Pro 


Gly 


Glu 


Ala 


Phe 


Asp 


Ser 






130 








135 










14 0 








Thr 




Leu 


Lys 


He 


Asp 


Pro 


Tyr 


Thr 


Leu 


Phe 


Leu 


Pro 


Lys 


He 


Glu 


Ala 


45 


145 








150 










155 










160 




Ser 


Thr 


Ser 


He 


Ser 
165 


Asp 


Ala 


Asn 


Thr 


Gin 
170 


Arg 


Val 


Phe 


Glu 


Thr 
175 


Leu 




Asn 


Lys 


He 


Lys 


Thr 


Asn 


Leu 


Val 


Val 


Asn 


Tyr 


Arg 


Asn 


Glu 


Asn 


Lys 








180 










185 










190 




Ala 


50 


Phe 


Lys 


Asp 


His 


Glu 


Asn 


His 


Trp 


Glu 


Ala 


Phe 


Thr 


Pro 


Gin 


Thr 






195 










200 










205 










Glu 


Glu 
210 


Phe 


Thr 


Asn 


Leu 


Met 
215 


Leu 


Asn 


Met 


He 


Ala 
220 


Val 


Leu 


Asp 


Ser 




Gin 


Ser 


Trp 


Gly 


Asp 


Ala 


He 


Leu 


Asn 


Ala 


Pro 


Phe 


Glu 


Phe 


Thr 


Asn 


55 


225 










230 










235 










240 
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Ser Pro Thr Asp Cys Asp Asn Asp Pro Ser Lys Cys Val Asn Pro Gly 

245 250 255 

Thr Asn Gly Leu Val Asn Ser Lys Val Asp Gin Lys Tyr Val Leu Asn 
260 265 270 

5 Lys Gin Asp lie Val Asn Lys Phe Lys Asn Lys Ala Asp Leu Asp Val 
275 280 285 

lie Val Leu Lys Asp Ser Gly Val Val Gly Leu Gly Ser Asp lie Thr 

290 295 300 

Pro Ser Asn Asn Asp Asp Gly Lys His Tyr Gly Gin Leu Gly Val Val 
10 305 310 315 320 

Ala Ser Ala Leu Asp Pro Lys Lys Leu Phe Gly Asp Asn Leu Lys Thr 

325 330 335 

lie Asn Leu Glu Asp Leu Arg Thr lie Leu His Glu Phe Ser His Thr 
340 345 350 

1 5 L ys G l y Tyr Gl y His Asn Gl y Asn Met Thr Tyr Gin Ar g Val _Pr o_ J/a 1 
355 360 365 

Thr Lys Asp Gly Gin Val Glu Lys Asp Ser Asn Gly Lys Pro Lys Asp 

370 375 380 

Ser Asp Gly Leu Pro Tyr Asn Val Cys Ser Leu Tyr Gly Gly Ser Asn 
20 385 390 395 400 

Gin Pro Ala Phe Pro Ser Asn Tyr Pro Asn Ser lie Tyr His Asn Cys 

405 410 415 

Ala Asp Val Pro Ala Gly Phe Leu Gly Val Thr Ala Ala Val Trp Gin 
420 425 430 

25 Gin Leu lie Asn Gin Asn Ala Leu Pro lie Asn Tyr Ala Asn Leu Gly 
435 440 445 

Ser Gin Thr Asn Tyr Asn Leu Asn Ala Ser Leu Asn Thr Gin Asp Leu 

450 455 460 

Ala Asn Ser Met Leu Ser Thr lie Gin Lys Thr Phe Val Thr Ser Ser 
30 465 470 475 480 

Val Thr Asn His His Phe Ser Asn Ala Ser Gin Ser Phe Arg Ser Pro 

485 490 495 

lie Leu Gly Val Asn Ala Lys lie Gly Tyr Gin Asn Tyr Phe Asn Asp 
500 505 510 

35 Phe lie Gly Leu Ala Tyr Tyr Gly lie lie Lys Tyr Asn Tyr Ala Lys 
515 520 525 

Ala Val Asn Gin Lys Val Gin Gin Leu Ser Tyr Gly Gly Gly lie Asp 

530 535 540 

Leu Leu Leu Asp Phe lie Thr Thr Tyr Ser Asn Lys Asn Ser Pro Thr 
40 545 550 555 560 

Gly lie Gin Thr Lys Arg Asn Phe Ser Ser Ser Phe Gly lie Phe Gly 

565 570 575 

Gly Leu Arg Gly Leu Tyr Asn Ser Tyr Tyr Val Leu Asn Lys Val Lys 
580 585 590 

45 Gly Ser Gly Asn Leu Asp Val Ala Thr Gly Leu Asn Tyr Arg Tyr Lys 
595 600 605 

His Ser Lys Tyr Ser Val Gly He Ser He Pro Leu lie Gin Arg Lys 

610 615 620 

Ala Ser Val Val Ser Ser Gly Gly Asp Tyr Thr Asn Ser Phe Val Phe 
50 625 630 635 640 

Asn Glu Gly Ala Ser His Phe Lys Val Phe Phe Asn Tyr Gly Trp Val 
645 650 655 

Phe 



55 
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(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 167 amino acids 
5 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

10 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

15 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...167 



20 



45 



50 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 119 



Met Lys Leu Val Ser Leu lie Val Ala Leu Val Phe Cys Cys Phe Leu 
1 5 10 15 

Gly Ala Val Glu Leu Pro Gly Val Tyr Gin Thr Gin Glu Phe Leu Tyr 
20 25 30 

25 Met Lys Ser Ser Phe Val Glu Phe Phe Glu His Asn Gly Lys Phe Tyr 
35 40 45 

Ala Tyr Gly lie Ser Asp Val Asp Gly Ser Lys Ala Lys Lys Asp Lys 

50 55 60 

Leu Asn Pro Asn Pro Lys Leu Arg Asn Arg Ser Asp Lys Gly Val Val 
30 65 70 75 80 

Phe Leu Ser Asp Leu He Lys Val Gly Glu Gin Ser Tyr Lys Gly Gly 

85 90 95 

Lys Ala Tyr Asn Phe Tyr Asp Gly Lys Thr Tyr His Val Arg Val Thr 
100 105 110 

35 Gin Asn Ser Asn Gly Asp Leu Glu Phe Thr Ser Ser Tyr Asp Lys Trp 
lis 120 125 

Gly Tyr Val Gly Lys Thr Phe Thr Trp Lys Arg Leu Ser Asp Glu Glu 

130 135 140 

He Lys Asn Leu Lys Leu Lys Arg Phe Asn Leu Asp Glu Val Leu Lys 
40 145 150 155 160 

Thr Leu Lys Asp Ser Pro He 
165 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
55 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME/KEY: misc_feature 
5 (B) LOCATION 1...294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Met Ser Asn Gin Ala Ser His Leu Asp Asn Phe Met Asn Ala Lys Asn 
10 1 5 10 15 

Pro Lys Ser Phe Phe Asp Asn Lys Gly Asn Thr Lys Phe lie Ala lie 

20 25 30 

Thr Ser Gly Lys Gly Gly Val Gly Lys Ser Asn lie Ser Ala Asn Leu 
35 40 45 

15 Ala Tyr Ser Leu Tyr Lys Lys Gly Tyr Lys Val Gly Val Phe Asp A la 
50 55 60 

Asp lie Gly Leu Ala Asn Leu Asp Val lie Phe Gly Val Lys Thr His 
65 70 75 80 

Lys Asn lie Leu His Ala Leu Lys Gly Glu Ala Lys Leu Gin Glu lie 
20 85 90 95 

lie Cys Glu lie Glu Pro Gly Leu Cys Leu lie Pro Gly Asp Ser Gly 

100 105 110 

Glu Glu lie Leu Lys Tyr lie Ser Gly Ala Glu Ala Leu Asp Arg Phe 
115 120 125 

25 Val Asp Glu Glu Gly Val Leu Ser Ser Leu Asp Tyr lie Val lie Asp 
130 135 140 

Thr Gly Ala Gly He Gly Ala Thr Thr Gin Ala Phe Leu Asn Ala Ser 
145 150 155 160 

Asp Cys Val Val He Val Thr Thr Pro Asp Pro Ser Ala He Thr Asp 
30 165 170 175 

Ala Tyr Ala Cys He Lys He Asn Ser Lys Asn Lys Asp Glu Leu Phe 

180 185 190 

Leu He Ala Asn Met Val Ala Gin Pro Lys Glu Gly Arg Ala Thr Tyr 
195 200 205 

35 Glu Arg Leu Phe Lys Val Ala Lys Asn Asn He Ala Ser Leu Glu Leu 
210 215 220 

His Tyr Leu Gly Ala He Glu Asn Ser Ser Leu Leu Lys Arg Tyr Val 
225 230 235 240 

Arg Glu Arg Lys He Leu Arg Lys He Ala Pro Asn Asp Leu Phe Ser 
40 245 250 255 

Gin Ser He Asp Gin He Ala Ser Leu Leu Val Ser Lys Leu Glu Thr 

260 265 270 

Gly Thr Leu Glu He Pro Lys Glu Gly Leu Lys Ser Phe Phe Lys Arg 
275 280 285 

45 Leu Leu Lys Tyr Leu Gly . 
290 

(2) INFORMATION FOR SEQ ID NO: 121: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 72 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



55 



(ii) MOLECULE TYPE: protein 
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10 



(iii) HYPOTHETICAIi: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATtJRE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION l..,372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



Leu Glu Pro Ser Arg Asn Arg Leu Lys His Ala Ala Phe Phe Val Gly 
1 5 10 15 

15 Leu Phe lie Val Leu Phe Leu lie lie Met Lys His Gin Thr Ser Pro 
20 25 30 

Tyr Ala Phe Thr His Asn Gin Ala Leu Val Thr Gin Thr Pro Pro Tyr 

35 40 45 

Phe Thr Gin Leu Thr lie Pro Lys Pro Asn Asp Ala Leu Ser Ala His 
20 50 55 60 

Ala Ser Ser Leu lie Ser Leu Pro Asn Asp Asn Leu Leu Ser Ala Tyr 
65 70 75 80 

Phe Ser Gly Thr Lys Glu Gly Ala Arg Asp Val Lys He Ser Ala Asn 
85 90 95 

25 Leu Phe Asp Ser Lys Thr Asn Arg Trp Ser Glu Ala Phe He Leu Leu 
100 105 110 

Thr Lys Glu Glu Leu Ser His His Ser His Glu Tyr He Lys Lys Leu 

115 120 125 

Gly Asn Pro Leu Leu Phe Leu His Asp Asn Lys He Leu Leu Phe Val 
30 130 135 140 

Val Gly Val Ser Met Gly Gly Trp Ala Thr Ser Lys He Tyr Gin Phe 
145 150 155 160 

Glu Ser Ala Leu Glu Pro He His Phe Lys Phe Ala Arg Lys Leu Ser 
165 170 175 

35 Leu Ser Pro Phe Leu Asn Leu Ser His Leu Val Arg Asn Lys Pro Leu 
180 185 190 

Asn Thr Thr Asp Gly Gly Phe Met Leu Pro Leu Tyr His Glu Leu Ala 

195 200 205 

Thr Gin Tyr Pro Leu Leu Leu Lys Phe Asp Gin Gin Asn Asn Pro Arg 
40 210 215 220 

Glu Leu Leu Arg Pro Asn Thr Leu Asn His Gin Leu Gin Pro Ser Leu 
225 230 235 240 

Thr Pro Phe Lys Asp Cys Ala Val Met Ala Phe Arg Asn His Ser Phe 
245 250 255 

45 Lys Asp Ser Leu Met Leu Glu Thr Cys Lys Thr Pro Thr Asp Trp Gin 
260 265 270 

Lys Pro He Ser Thr Asn Leu Lys Asn Leu Asp Asp Ser Leu Asn Leu 

275 280 285 

Leu Asn Leu Asn Gly He Leu Tyr Leu He His Asn Pro Ser Asp Leu 
50 290 295 300 

Ser Leu Arg Arg Lys Glu Leu Trp Leu Ser Lys Leu Glu Asn Ser Asn 
305 310 315 320 

Ser Phe Lys Thr Leu Lys Val Leu Asp Lys Ala Asn Glu Val Ser Tyr 
325 330 335 

55 Pro Ser Tyr Ser Leu Asn Pro His Phe He Asp He Val Tyr Thr Tyr 
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Pro Ala lie Ala Gly Leu Phe Ser Lys Glu Arg Lys Glu Lys Pro Ser 

675 680 685 

Ser Lys Glu lie Gin Asp Glu Asp Val Phe lie Ser Ala Lys Gin Arg 
690 695 700 

5 Tyr Glu Lys Ala His Lys lie He Pro He Ser Thr Arg He His Ala 
705 710 715 720 

Lys Asp Val Val Leu He Tyr Lys Lys Met Pro Phe Pro Leu Glu Asn 

725 730 735 

Leu Asp He Val Ala Gin Asp Asp Arg Val Lys He Asp Gly Asn Tyr 
10 740 745 750 

Lys Asn Ala Met He Met Ala Asp Leu Val His Gly Ala Leu Tyr Leu 

755 760 765 

Lys Ala His Asn Phe Ser Gly Asp Tyr He Asn Thr He Leu Gin Lys 
770 775 780 

15 Asp Phe Val Glu G ly Gly Leu Phe Thr Leu He Gly Ala Leu Glu As p 
785 790 795 800 

Gin Val Phe Asn Gly Glu Leu Lys Phe Gin Asn Thr Ser Leu Lys Asn 

805 810 815 

Phe Ala Leu Met Gin Asn Met Val Asn Leu He Asn Thr He Pro Ser 
20 820 825 830 

Leu He Val Phe Arg Asn Pro His Leu Gly Ala Asn Gly Tyr Gin He 

835 840 845 

Lys Thr Gly Ser Val Val Phe Gly He Thr Lys Glu Tyr Leu Gly Leu 
850 855 860 

25 Glu Lys He Asp Leu Val Gly Lys Thr Leu Asp He Ala Gly Asn Gly 
865 870 875 880 

He He Glu Leu Asp Lys Asn Lys Leu Asp Leu Asn Leu Glu Val Ser 

885 890 895 

Thr He Lys Ala Leu Ser Asn Val Leu Asn Lys He Pro He Val Gly 
30 900 905 910 

Tyr Leu Val Leu Gly Lys Gly Gly Lys He Thr Thr Asn Val Asn Val 

915 920 925 

Lys Gly Thr Leu Asp Lys Pro Lys Thr Gin Val Thr Leu Ala Ser Asp 
930 935 940 

35 He He Gin Ala Pro Phe Lys He Leu Arg Arg He Phe Thr Pro He 
945 950 955 960 

Asp He He Val Asp Glu Val Lys Lys Asn He Asp Ser Lys Arg Lys 
965 970 975 

Leu Lys 

40 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 4 77 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : 1 inear 



50 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...477 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Met Asn Thr He He Arg Tyr Ala Ser Leu Trp Gly Leu Cys He Thr 
1 5 10 15 

Leu Thr Leu Ala Gin Thr Pro Ser Lys Thr Pro Asp Glu He Lys Gin 
10 20 25 30 

He Leu Asn Asn Tyr Ser His Lys Asn Leu Lys Leu He Asp Pro Pro 

35 40 45 

Thr Ser Ser Leu Glu Ala Thr Pro Gly Phe Leu Pro Ser Pro Lys Glu 
50 55 60 

15 Thr Ala Thr Thr He Asn Gin Glu He Ala Lys Tyr His Glu Lys Ser 
65 70 75 80 

Asp Lys Ala Ala Leu Gly Leu Tyr Glu Leu Leu Lys Gly Ala Thr Thr 

85 90 95 

Asn Leu Ser Leu Gin Ala Gin Glu Leu Ser Val Lys Gin Ala Met Lys 
20 100 105 110 

Asn His Thr He Ala Lys Ala Met Phe Leu Pro Thr Leu Asn Ala Ser 

115 120 125 

Tyr Asn Phe Lys Asn Glu Ala Arg Asp Thr Pro Glu Tyr Lys His Tyr 
130 135 140 

25 Asn Thr Gin Gin Leu Gin Ala Gin Val Thr Leu Asn Val Phe Asn Gly 
145 150 155 160 

Phe Ser Asn Val Asn Asn Val Lys Glu Lys Ser Ala Thr Tyr Arg Ser 

165 170 175 

Thr Val Ala Asn Leu Glu Tyr Ser Arg Gin Ser Val Tyr Leu Gin Val 
30 180 185 190 

Val Gin Gin Tyr Tyr Glu Tyr Phe Asn Asn Leu Ala Arg Met He Ala 

195 200 205 

Leu Gin Lys Lys Leu Glu Gin He Gin Thr Asp He Lys Arg Val Thr 
210 215 220 

35 Lys Leu Tyr Asp Lys Gly Leu Thr Thr He Asp Asp Leu Gin Ser Leu 
225 230 235 240 

Lys Ala Gin Gly Asn Leu Ser Glu Tyr Asp He Leu Asp Met Gin Phe 

245 250 255 

Ala Leu Glu Gin Asn Arg Leu Thr Leu Glu Tyr Leu Thr Asn Leu Ser 
40 260 265 270 

Val Lys Asn Leu Lys Lys Thr Thr He Asp Ala Pro Asn Leu Gin Leu 

275 280 285 

Arg Glu Arg Gin Asp Leu Val Ser Leu Arg Glu Gin He Ser Ala Leu 
290 295 300 

45 Arg Tyr Gin Asn Lys Gin Leu Asn Tyr Tyr Pro Lys He Asp Val Phe 
305 310 . 315 320 

Asp Ser Trp Leu Phe Trp He Gin Lys Pro Ala Tyr Ala Thr Gly Arg 

325 330 335 

Phe Gly Asn Phe Tyr Pro Gly Gin Gin Asn Thr Ala Gly Val Thr Ala 
50 340 345 350 

Thr Leu Asn He Phe Asp Asp He Gly Leu Ser Leu Gin Lys Gin Ser 

355 360 365 

He Met Leu Gly Gin Leu Ala Asn Glu Lys Asn Leu Ala Tyr Lys Lys 
370 375 380 

55 Leu Glu Gin Glu Lys Asp Glu Gin Leu Tyr Arg Lys Ser Leu Asp II 



e 
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340 345 350 

Asn Arg Ser His lie Lys His lie Arg Phe Asn Met Ala Tyr Leu Asn 

355 360 365 

Ser Leu Leu Lys 
5 370 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 978 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



20 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc^feature 

(B) LOCATION 1.,.978 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 



Met Lys Lys 
1 

Leu Phe Val 

30 

Ser Asn Gly 
35 

Ser Glu Leu 
50 

35 Arg Val Asp 
65 

Leu Glu Val 
Val Ser Tyr 

40 

Lys Asn Lys 
115 

Phe Pro Gly 
130 

45 Lys Leu Lys 
145 

Asp Gly Asn 
Leu lie Val 

50 

Gly Leu Thr 
195 

Met Lys Ser 
210 

55 Lys Asn Leu 



Arg Lys His Val 
5 

Ala Val Phe Thr 
20 

lie His lie Gin 

Tyr Leu Lys Leu 
55 

Leu Ser Ser Phe 
70 

Ser Asp Leu lie 
85 

Phe Glu Lys Leu 
100 

Ala Asn lie Phe 

lie Lys Gly Glu 
135 

lie lie Asn Leu 
150 

Ala His Tyr Ser 
165 

Lys Pro Leu Val 
180 

Asp Leu Lys Thr 

Leu Ala Phe Leu 
215 

Lys Thr Trp lie 



Ser Lys Lys Val 
10 

Leu Leu Val Val 
25 

Asn Leu Lys lie 
40 

Asn Asn Lys Leu 

Phe His Gin Lys 
75 

Lys Asn lie Arg 
90 

Lys Val Lys Glu 
105 

Phe Asp Gly Asn 
120 

Phe Ser Leu Glu 

Leu Phe Lys Asp 
155 

Pro Lys Ala Arg 
170 

Glu Pro Ser Ala 
.185 

lie Glu Leu Lys 
200 

Lys Pro Leu Phe 
Phe Asp Lys lie 



Phe Asn Val lie lie 
15 

lie His Lys Thr Leu 
30 

Gly Lys Leu Gly lie 
45 

Ser Leu Glu Val Glu 
60 

Pro Thr Lys Lys Arg 
80 

Tyr Gly lie Trp Ala 
95 

lie lie Leu Asp Asp 
110 

Lys Tyr Glu Leu Glu 
125 

Asp Asp Lys Asn lie 
140 

Val Lys Val Gin Val 
160 

Lys Met Ala Phe Asn 
175 

Ala lie Tyr Leu Gin 
190 

lie Asn Thr Ser Pro 
205 

Gin Arg Gin Ser Gin 
220 

Gin Phe Ala Ser Phe 
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225 230 235 240 

Lys He Asp Asn Ala Leu He Lys Ala Asn Phe Thr Pro Ser Glu Phe 

245 250 255 

He Pro Ser Leu Leu Glu Asn Ser Val Val Lys Ala Thr Leu lie Lys 
5 260 265 270 

Pro Ser Val Val Phe Asn Asp Gly Leu Ser Pro He Lys Met Asp Lys 

275 280 285 

Thr Glu Leu He Phe Lys Asn Lys Gin Leu Leu He Gin Pro Gin Lys 
290 295 300 

10 He Thr Tyr Glu Thr Met Glu Leu Thr Gly Ser Tyr Ala Thr Phe Ser 
305 310 315 320 

Asn Leu Leu Glu Ala Pro Lys Leu Glu Val Phe Leu Lys Thr Thr Pro 

325 330 335 

Asn Tyr Tyr Gly Asp Ser He Lys Asp Leu Leu Ser Ala Tyr Lys Val 
15 340 345 350 

Val Leu Pro Leu Asp Lys He Ser Met Pro Ser Ser Ala Asp Leu Lys 

355 360 365 

Leu Thr Leu Gin Phe Leu Lys Asn Thr Ala Pro Leu Phe Ser Val Gin 
370 375 380 

':^0 Gly Ser Val Asn Leu Gin Glu Gly Thr Phe Ser Leu Tyr Asn He Pro 
385 390 395 400 

Leu Tyr Thr Gin Ser Ala Gin He Asn Leu Asp He Ala Gin Glu Tyr 

405 410 415 

Gin Tyr He Tyr He Asp Thr He His Thr Arg Tyr Ala Asn Met Leu 
25 420 425 430 

Asp Leu Asp Ala Lys He Ala Leu Asp Leu Gly Gin Lys Asn Leu Ser 

435 440 445 

Leu Asp Ser Leu Val His Lys He Gin Val Asn Thr Asn Asn Asn He 
450 455 460 

30 Asn Met Arg Ser Tyr Asp Pro Asn Asn Thr Gin Glu Asp Pro Gin Thr 
465 470 475 480 

Asn Phe Thr Leu Asp Leu Lys Ser Leu His Ser He He Gin Glu Gly 

485 490 495 

Glu Asn Ser Glu Val Phe Arg Arg Lys He He Asp Thr He Lys Ala 
35 500 505 510 

Gin Ser Glu Asp Lys Phe Thr Lys Asp Val Phe Tyr Ala Thr Gly Asp 

515 520 525 

Thr Leu Lys Ser Leu Ser Leu Ser Phe Asp Phe Ser Asn Pro Asp His 
530 535 540 

40 He Gin Trp Ser Val Pro Gin Leu Leu Leu Glu Gly Glu Phe Lys Asp 
545 550 555 560 

Asn Ala Tyr Thr Phe Lys He Lys Asp Leu Lys Lys He Lys Pro Tyr 

565 570 575 

Ser Pro He Met Asp Tyr He Ala Leu Lys Asp Gly Ser Leu Glu Val 
45 580 585 590 

Ser Thr Ser Asp Phe Val Asn He Asp Phe Phe Ala Lys Asp Leu Lys 

595 600 605 

He Asn Leu Pro He Tyr Arg Ser Asp Gly Ser His Phe Asp Ser Phe 
610 615 620 

50 Ser Leu Phe Gly Ser He Asn Lys Asp Glu He Ser Val Tyr Thr Pro 
625 630 635 640 

Ser Lys Ser He Ser He Lys Val Lys Gly Asp Gin Lys Asp He Thr 

645 650 655 

Leu Asn Asn He Asp Leu Ser He Asp Asp Phe Leu Asp Ser Lys Met 
55 660 665 670 
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385 390 
Ala Arg Ala Lys lie Glu Ser Ser 
405 

Leu Ser Phe Ala Asn lie Lys Arg 
5 420 

Phe Thr Thr Tyr Leu Arg Gly Leu 
435 440 
Ala Tyr Asn Leu Ala Leu Asn Asn 
450 455 
10 He Phe Asn Ser Gly His Lys He 
465 470 



395 400 
Lys Ala Ser Leu Asp Ala Ala Asn 

410 415 
Lys Tyr Asp Ala Asn Leu Val Asp 
425 430 
Thr Thr Arg Phe Asp Ala Glu Val 
445 

Tyr Glu Val Gin Lys Ala Asn Tyr 
460 

Asp Asp Tyr Val His 
475 



(2) INFORMATION FOR SEQ ID NO: 124: 



15 (i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 (ii> MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...412 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



Met Leu Ser 
1 

35 Leu Leu Thr 

Leu Glu lie 
35 

lie Gin Ala 
40 50 

Val Val Ser 
65 

Ala Asn Val 

45 Met Ser Leu 

Thr Gin Ser 
115 

Leu Lys Lys 

50 130 

lie Glu Asn 
145 

lie Lys Asn 
55 Pro Asp Leu 



Phe lie Ser Ala 
5 

Ala Leu Leu Leu 
20 

Gin Thr Phe Val 

Leu Gin Glu Gin 
55 

Lys Trp Asp Asn 
70 

Ser Asp Phe Phe 
85 

Gly Leu Ser Gin 
100 

Lys Met lie Asn 

Thr Lys Gin Gin 
135 

Tyr Lys Asn Gin 
150 

Leu Glu Asn Thr 
165 

lie Ala He Ala 



Phe Asp Lys Arg 
10 

Leu Phe Ser Leu 
25 

Ala Lys Tyr Leu 
40 

He Asp Ala Leu 

Pro He Leu Tyr 
75 

Arg Leu Asp Ser 
90 

Lys Val Asp Leu 
105 

Leu Glu Lys Gin 
120 

Leu Val He Asn 

Gin Glu He Glu 
155 

Leu Tyr Gin Ala 
170 

Lys Leu Glu He 



Gly Val Ser He Arg 
15 

Gly Leu Ala Lys Asp 
30 

Ser Lys Asn Gin Lys 
45 

Asp Ser Gin Glu Lys 
60 

Leu Gly Tyr Asn Asn 
80 

Thr Leu Met Gin Asn 
95 

Asn Gly Lys Lys Leu 
110 

Lys Lys He Leu Glu 
125 

Leu Met He Asn Gly 
140 

Leu Leu Asn Thr Ala 
160 

Asn His Ser Ser Ser 
175 

Leu Lys Ser Leu Leu 
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10 



15 



20 



25 



30 



50 









180 










185 










190 






Glu 


He 


Gin 


Lys 


Asn 


Asp 


Leu 


Glu 


Val 


Ala 


Leu 


Ser 


Ser 


Ser 


His 


Tyr 






195 










200 










205 








Ser 


Met 


Gly 


Glu 


Leu 


Thr 


Phe 


Lys 


Glu 


Asn 


Glu 


He 


Leu 


Ser 


He 


Ala 




210 










215 










220 










Pro 


Lfys 


Asn 


Phe 


Glu 


Phe 


Asn 


Asn 


Glu 


Gin 


Glu 


Leu 


His 


Asn 


He 


Ser 


225 








230 










235 










240 


Ala 


Thr 


Asn 


Tyr 


Asp 


He 


Ala 


He 


Ala 


Arg 


Leu 


Asp 


Glu 


Glu 


Lys 


Ala 










245 










250 










255 




Gin 


Lys 


Asp 


He 


Thr 


Leu 


Ala 


Lys 


Lys 


Ser 


Phe 


Leu 


Glu 


Asp 


He 


Asn 






260 










265 










270 






Val 


Thr 


Gly 


Val 


Tyr 


Tyr 


Phe 


Arg 


Ser 


Lys 


Gin 


Tyr 


Tyr 


Asn 


Tyr 


Asp 






275 










280 










285 








Met 


Phe 


Ser 


Val 


Ala 


Leu 


Ser 


He 


Pro 


Leu 


Pro 


Leu 


Tyr 


Gly 


Lys 


Gin 




290 










295 










300 










Ala 


Lys 


Leu 


Val 


Glu 


•Gin 


Lys 


Lys 


Lys 


Glu 


Ser 


Leu 


Ala 


Phe 


Lys 


Ser 


305 








310 










3X5 










320 


Glu 


Val 


Glu 


Asn 


Ala 


Lys 


Asn 


Lys 


Thr 


Arg 


His 


Leu 


Ala 


Leu 


Lys 


Leu 










325 










330 










335 




Leu 


Lys 


Lys 


Leu 


Glu 


Thr 


Leu 


Gin 


Lys 


Asn 


Leu 


Glu 


Ser 


He 


Asn 


Lys 






340 










345 










350 






He 


He 


Lys 


Gin 


Asn 


Glu 


Lys 


He 


Ala 


Gin 


He 


Tyr 


Ala 


Leu 


Asp 


Leu 






355 










360 










365 








Lys 


Thr 


ASH 


Gly 


Asp 


Tyr 


Asn 


Ala 


Tyr 


Tyr 


Asn 


Ala 


Leu 


Asn 


Asp 


Lys 


370 










375 










380 










He 


Thr 


He 


Gin 


He 


Thr 


Gin 


Leu 


Glu 


Thr 


Leu 


Ser 


Ala 


Leu 


Asn 


Ser 


385 










390 










395 










400 


Ala 


Tyr 


Leu 


Ser 


Leu 


Gin 


Asn 


Leu 


Lys 


Gly 


Leu 


Glu 











405 410 
(2) INFORMATION FOR SEQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 amino acids 
35 (B) TYPE: amino acid 

{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

40 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

45 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 
CB) LOCATION 1. . .137 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



Met Arg He Val Arg Asn Leu Phe Leu Val Ser Phe Val Ala Tyr Ser 
1 5 10 15 

Ser Ala Phe Ala Ala Asp Leu Glu Thr Gly Thr Lys Asn Asp Lys Lys 
20 25 30 

55 Ser Gly Lys Lys Phe Tyr Lys Leu His Lys Asn His Gly Ser Glu Thr 



wo 98/18323 



PCT/US97/19575 



- 203 - 

35 40 45 

Glu Thr Lys Asn Asp Lys Lys Leu Tyr Asp Phe Thr Lys Asn Ser Gly 

50 55 60 

Leu Glu Gly Val Asp Leu Glu Lys Ser Pro Asn Leu Lys Ser His Lys 
5 65 70 75 80 

Lys Ser Asp Lys Lys Phe Tyr Lys Gin Leu Ala Lys Asn Asn lie Ala 

85 90 95 

Glu Gly Val Ser Met Pro lie Val Asn Phe Asn Lys Ala Leu Ser Phe 
100 105 110 

10 Gly Pro Tyr Phe Glu Arg Thr Lys Ser Lys Lys Thr Gin Tyr Met Asp 
115 120 125 

Gly Gly Leu Met Met His lie Arg Phe 
130 135 

15 (2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 amino acids 

(B) TYPE: amino acid 
) (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
) (A) NAME/KEY: misc_feature 

(B) LOCATION 1,..309 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

i Leu Met Pro Gin Asn Gin Leu Val lie Thr lie lie Asp Glu Ser Gly 
15 10 15 

Ser Lys Gin Leu Lys Phe Ser Lys Asn Leu Lys Arg Asn Leu lie lie 

20 25 30 

Ser Val Val lie Leu Leu Leu lie Val Gly Leu Gly Val Gly Phe Leu 
I 35 40 45 

Lys Phe Leu lie Ala Lys Met Asp Thr Met Thr Ser Glu Arg Asn Ala 

50 55 60 

Val Leu Arg Asp Phe Arg Gly Leu Tyr Gin Lys Asn Tyr Ala Leu Ala 
65 70 75 80 

i Lys Glu lie Lys Asn Lys Arg Glu Glu Leu Phe lie Val Gly Gin Lys 

85 90 95 

lie Arg Gly Leu Glu Ser Leu lie Glu lie Lys Lys Gly Ala Asn Gly 

100 105 110 

Gly Gly His Leu Tyr Asp Glu Val .Asp Leu Glu Asn Leu Ser Leu Asn 
I 115 120 125 

Gin Lys His Leu Ala Leu Met Leu lie Pro Asn Gly Met Pro Leu Lys 

130 135 140 

Thr Tyr Ser Ala lie Lys Pro Thr Lys Glu Arg Asn His Pro lie Lys 
145 150 155 ISO 

Lys lie Lys Gly Val Glu Ser Gly lie Asp Phe lie Ala Pro Leu Asn 
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165 170 175 

Thr Pro Val Tyr Ala Ser Ala Asp Gly He Val Asp Phe Val Lys Thr 

180 185 190 

Arg Ser Asn Ala Gly Tyr Gly Asn Leu Val Arg He Glu His Ala Phe 
5 195 200 205 

Gly Phe Ser Ser He Tyr Thr His Leu Asp His Val Asn Val Gin Pro 

210 215 220 

Lys Ser Phe He Gin Lys Gly Gin Leu He Gly Tyr Ser Gly Lys Ser 
225 230 235 240 

10 Gly Asn Ser Gly Gly Glu Lys Leu His Tyr Glu Val Arg Phe Leu Gly 

245 250 255 

Lys He Leu Asp Ala Glu Lys Phe Leu Ala Trp Asp Leu Asp His Phe 

260 265 270 

Gin Ser Ala Leu Glu Glu Asn Lys Phe He Glu Trp Lys Asn Leu Phe 
15 275 280 285 

Trp Val Leu Glu Asp He Val Gin Leu Gin Glu His Val Asp Lys Asp 

290 295 300 

Thr Leu Lys Gly Gin 
305 



20 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 12 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 
25 (B) TYPE: amino acid 

{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

35 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...332 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 



Val 


Leu 


Tyr 


Phe 


Leu 


Thr 


Ser 


Leu 


Phe 


He 


Cys 


Ser 


Leu 


He 


Val 


Leu 


1 






5 










10 










15 




Trp 


Ser 


Lys 


Lys 


Ser 


Met 


Leu 


Phe 


Val 


Asp 


Asn 


Ala 


Asn 


Lys 


He 


Gin 






20 










25 










30 






Gly 


Phe 


His 


His 


Ala 


Arg 


Thr 


Pro 


Arg 


Ala 


Gly 


Gly 


Leu 


Gly 


He 


Phe 




35 










40 










45 








Leu 


Ser 
50 


Phe 


Ala 


Leu 


Ala 


Cys 
55 


Tyr 


Leu 


Glu 


Pro 


Phe 
60 


Glu 


Met 


Pro 


Phe 


Lys 


Gly 


Pro 


Phe 


Val 


Phe 


Leu 


Gly 


Leu 


Ser 


Leu 


Val 


Phe 


Leu 


Ser 


Gly 


65 










70 










75 










80 


Phe 


Leu 


Glu 


Asp 


He 


Asn 


Leu 


Ser 


Leu 


Ser 


Pro 


Lys 


He 


Arg 


Leu 


He 








85 










90 










95 




Leu 


Gin 


Ala 


Val 
100 


Gly 


Val 


Val 


Cys 


He 
105 


He 


Ser 


Ser 


Thr 


Pro 
110 


Leu 


Val 


Val 


Ser 


Asp 


Phe 


Ser 


Pro 


Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 


Phe 


He 


Ala 


Phe 
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30 



35 







115 










120 










125 








Leu 


Phe 


Ala 


He 


Phe 


Met 


Leu 


Val 


Gly 


He 


Ser 


Asn 


Ala 


He 


Asn 


He 




130 










135 










140 










lie 


Asp 


Gly 


Phe 


Asn 


Gly 


Leu 


Ala 


Ser 


Gly 


He 


Cys 


Ala 


He 


Ala 


Leu 


145 










150 










155 










160 


Leu 


Val 


He 


His 


Tyr 


He 


Asp 


Pro 


Ser 


Ser 


Leu 


Ser 


Cys 


Leu 


Leu 


Ala 










165 










170 










175 




Tyr 


Met 


Val 


Leu 


Gly 


Phe 


Met 


Val 


Leu 


Asn 


Phe 


Pro 


Ser 


Gly 


Lys 


He 








180 










185 










190 






Phe 


Leu 


Gly 


Asp 


Gly 


Gly 


Ala 


Tyr 


Phe 


Leu Gly Leu Val 


Cys 


Gly 


He 






195 










200 










205 








Ser 


Leu 


Leu 


His 


Leu 


Ser 


Leu 


Glu 


Gin 


Lys 


He 


Ser 


Val 


Phe 


Phe 


Gly 




210 










215 










220 










Leu 


Asn 


Leu 


Met 


Leu 


Tyr 


Pro 


Val 


He 


Glu 


Val 


Leu 


Phe 


Ser 


He 


Leu 


225 










230 










235 










240 


Arg Arg 


Lys 


He 


Lys 


Arg 


Gin 


Lys 


^a 


Thr 


Met 


Pro 


Asp 


Asn 


Leu 


His 










245 










250 










255 




Leu 


His 


Thr 


Leu 


Leu 


Phe 


Lys 


Phe 


Leu 


Gin 


Gin 


Arg 


Ser 


Phe 


Asn 


Tyr 








260 










265 










270 






Pro 


Asn 


Pro 


Leu 


Cys 


Ala 


Phe 


He 


Leu 


He 


Leu 


Cys 


Asn 


Leu 


Pro 


Phe 






275 










280 










285 








He 


Leu 


He 


Ser 


Val 


Leu 


Phe 


Arg 


Leu 


Asp 


Ala 


Tyr 


Ala 


Leu 


He 


Val 




290 










295 










300 










He 


Ser 


Leu 


Val 


Phe 


He 


Ala 


Cys 


Tyr 


Leu 


He 


Gly Tyr 


Ala 


Tyr 


Leu 


305 










310 










315 










320 


Asn Arg 


Gin 


Val 


Cys 


Ala 


Leu 


Glu 


Lys 


Arg 


Ala 


Phe 











10 



15 



20 



25 

Asn Arg Gin Val 

325 330 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
45 (B) LOCATION 1. ..271 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Met Asn He Phe Lys Arg He He Cys Val Thr Ala He Val Leu Gly 

50 1 5 10 15 

Phe Phe Asn Leu Leu Asp Ala Lys His His Lys Glu Lys Lys Glu Asp 

20 25 30 

His Lys He Thr Arg Glu Leu Lys Val Gly Ala Asn Pro Val Pro His 
35 40 45 

55 Ala Gin He Leu Gin Ser Val Val Asp Asp Leu Lys Glu Lys Gly He 
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50 55 60 



10 



15 



20 



25 



30 



35 



Lys 


Leu 


Val 


He 


Val 


Ser 


Phe 


Thr 


Asp 


Tyr 


Val 


Leu 


Pro 


Asn 


Leu 




65 










70 










75 










Q A 


Leu 


Asn 


Asp 


Gly 


Ser 


Leu Asp 


Ala 


Asn 


Tyr 


Phe 


Gin 


His 


Arg 


Pro 


Tyr 








85 










90 










95 




Leu 


Asp 


Arg 


Phe 


Asn 


Leu 


Asp 


Arg 


Lys 


Met 


His 


Leu 


vax 


vaXy 


Leu 








100 










105 










110 






Asn 


He 


His 
115 


Val 


Glu 


Pro 


Leu 


Arg 
120 


Phe 


Tyr 


Ser 


Gin 


Lys 
125 


lie 


Thr 


Asp 


He 


Lys 


Asn 


Leu 


Lys 


Lys 


Gly 


Ser 


Val 


He 


Ala 


Val 


Pro 


Asn 


Asp 


Pro 




130 










135 










140 








lie 


Ala 


Asn 


Gin 


Gly 


Arg 


Ala 


Leu 


He 


Leu 


Leu 


His 


Lys 


Gin 


Gly 


Leu 


145 










150 










155 










160 


Ala 


Leu 


Lys 


Asp 


Pro 


Ser 


Asn 


Leu 


Tyr 


Ala 


Thr 


Glu 


Phe 


Asp 


He 


Val 








165 










170 










175 




Lys 


Asn 


Pro 


Tyr 


Asn 


He 


Lys 


He 


Lys 


Pro 


Leu 


Glu 


Ala 


Ala 


Leu 


Leu 






180 










185 










190 






Pro 


Lys 


Val 


Leu 


Gly 


Asp 


Val 


Asp 


Gly Ala 


He 


He 


Thr 


Gly 


Asn 


Tyr 




195 










200 










205 








Ala 


Leu 
210 


Gin 


Ala 


Lys 


Leu 


Thr 
215 


Gly 


Ala 


Leu 


Phe 


Ser 
220 


Glu 


Asp 


Lys 


Asp 


Ser 


Pro 


Tyr 


Ala 


Asn 


Leu 


Val 


Ala 


Ser 


Arg 


Glu 


Asp 


Asn 


Ala 


Gin 


Asp 


225 








230 










235 










240 


Glu 


Ala 


He 


Lys 


Ala 


Leu 


He 


Glu 


Ala 


Leu 


Gin 


Ser 


Glu 


Lys 


Thr 


Arg 








245 










250 










255 




Lys 


Phe 


He 


Leu 


Asp 


Thr 


Tyr 


Lys 


Gly Ala 


He 


He 


Pro 


Ala 


Phe 








260 










265 










270 







(2) INFORjWATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

( i i i ) HYPOTHETI CAL : YES 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
45 (B) LOCATION 1., ..316 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Met Gin Glu Phe Ser Leu Trp Cys Asp Phe He Glu Arg Asp Phe Leu 
50 1 5 10 15 

Glu Asn Asp Phe Leu Lys Leu He Asn Lys Gly Ala He Cys Gly Ala 

20 25 30 

Thr Ser Asn Pro Ser Leu Phe Cys Glu Ala He Thr Lys Ser Ala Phe 
35 40 45 

55 Tyr Gin Asp Glu He Ala Lys Leu Lys Gly Lys Lys Ala Lys Glu He 
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50 






























Tyr 


Glu 


Thr 


Leu 


A, J. a 


Leu 


Lys 


Asp 




Leu 




/vxa 


Ser 


Ser 




Xieu 


65 










•7 n 










/ 3 










O \J 


Met 


Pro 


Leu 


Tyr 


LjXU 


Lys 


Asp 


Pro 


Asn 


Asn 


tixy 


Tyr 


XXc 


Ser 


Leu 


ox WL 










o o 
























lie 


Asp 


Pro 


Phe 


Leu 


oxU 


Asp 


Asp 


>vxa 


xxe 


Lys 


oer 


J. 


Asp 


Vji Li 


211 a 

M-La 








XU U 


























Lys 


Arg 


Leu 


Pne 


Lys 


Thr 


jueu 


Asn 


Arg 


Pro 


Asn 


vax 


wee 


xxe 


Lys 


vax 






115 










X2 U 










X^ 3 








Pro 


Ala 


Ser 


Glu 


Ser 


Ala 


Phe 


Glu 


vax 


xxe 


Ser 


AXa 


Leu 


Axa 


Gxn 


AX a 




130 










13 5 










X4(J 










Ser 


He 


Pro 


He 


Asn 


val 


Thr 


Leu 


Val 


Pne 


Ser 


Pro 


Lys 


xxe 


TV T fa 

AX a 


Gxy 


145 










150 










X3D 










XO u 


Glu 


He 


Ala 


Gin 


He 


Leu 


Ala 


Lys 


Glu 


AXa 


Arg 


Lys 


Arg 


TV 1 

AXa 


vax 


xxe 










165 










170 










X/b 




"Ser' 


" Var 


"Phe 


Val^ 


'Ser' Arg' 


^Phe~ 


~Asp~ 


~Lys~ 


"GluT 


lie 


Asp 


Pro 


Leu 


vax 




Pro 








180 










185 










190 






Gin 


Asn 


Leu 


Gin 


Ala 


Gin 


Ser 


Gly 


He 


Met 


Asn 


Ala 


Thr 


Glu 


Cys 


Tyr 






195 










200 










2 05 








Tyr 


Gin 


He 


Asn 


Gin 


His 


Ala 


Asn 


Lys 


Leu 


He 


Ser 


Thr 


Leu 


Phe 


AXa 




210 










215 










220 










Ser 


Thr 


Gly 


Val 


Lys 


Ser 


Asn 


Ser 


Leu 


TV T 

AXa 


Lys 


Asp 


Tyr 


Tyr 


xxe 


Lys 


225 










230 










•5 1 










•5 4, n 


Ala 


Leu 


Cys 


Phe 


Lys 


Asn 


Ser 


He 


Asn 


Thr 


Ala 


Pro 


Leu 


Asp 


Ala 


Leu 










245 










250 










255 




Asn 


Ala 


Tyr 


Leu 


Leu 


Asp 


Pro 


Asn 


Thr 


Glu 


Cys 


Gin 


Thr 


Pro 


Leu 


Lys 








260 










265 










270 






He 


Tlir 


Glu 


He 


Glu 


Ala 


Phe 


Lys 


Lys 


Glu 


Leu 


Lys 


Thr 


His 


Asn 


He 






275 










280 










285 








Asp 


Leu 


Glu 


Asn 


Thr 


Ala 


Gin 


Lys 


Leu 


Leu 


Lys 


Glu 


Gly 


Leu 


He 


Ala 




290 










295 










300 










Phe 


Lys 


Gin 


Ser 


Phe 


Glu 


Lys 


Leu 


Leu 


Ser 


Ser 


Phe 











305 310 315 



(2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

( A ) NAME / KEY : mi s c_f e a tur e 

(B) LOCATION 1...260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 
Met Lys Thr Asn Gly His Phe Lys Asp Phe Ala Trp Lys Lys Cys Phe 



wo 98/18323 PCT/US97/19575 

-208- 



45 



1 








5 










10 










15 




Leu 


Gly Ala 


Ser 


Val 


Val 


Ala 


Leu 


Leu 


Val 


Gly 


Cys 


Ser 


Pro 


His 


He 








20 










25 










30 






He 


Glu 


Thr 


Asn 


Glu 


Val 


Ala 


Leu 


Lys 


Leu 


Asn 


Tyr 


His 


Pro 


Ala 


Ser 






35 










40 










45 








Glu 


Lys 


Val 


Gin 


Ala 


Leu 


Asp 


Glu 


Lys 


He 


Leu 


Leu 


Leu 


Arg 


Pro 


Ala 




50 










55 










60 










Phe 


Gin 


Tyr 


Ser 


Asp 


Asn 


He 


Ala 


Lys 


Glu 


Tyr 


Glu 


Asn 


Lys 


Phe 


Lys 


65 










70 










75 










80 


Asn 


Gin 


Thr 


Thr 


Leu 


Lys 


Val 


Glu 


Glu 


He 


Leu 


Gin 


Asn 


Gin 


Gly 


Tyr 










85 










90 










95 




Lys 


Val 


He 


Asn 


Val 


Asp 


Ser 


Ser 


Asp 


Lys 


Asp 


Asp 


Phe 


Ser 


Phe 


Ala 






100 










105 










110 






Gin 


Lys 


Lys 


Glu 


Gly 


Tyr 


Leu 


Ala 


Val 


Ala 


Met 


Asn 


Gly Glu 


He 


Val 






115 










120 










125 








Leu 


Arg 


Pro 


Asp 


Pro 


Lys 


Arg 


Thr 


lie 


Gin 


Lys 


Lys 


Ser 


Glu 


Pro 


Gly 




130 










135 










140 










Leu 


Leu 


Phe 


Ser 


Thr 


Gly 


Leu 


Asp 


Lys 


Met 


Glu 


Arg 


Val 


Leu 


He 


Pro 


145 










150 










155 










160 


Ala 


Gly 


Phe 


Val 


Lys 


Val 


Thr 


He 


Leu 


Glu 


Pro 


Met 


Ser 


Gly 


Glu 


Ser 










165 










170 










175 




Leu 


Asp 


Ser 


Phe 


Thr 


Met 


Asp 


Leu 


Ser 


Glu 


Leu 


Asp 


He 


Gin 


Glu 


Lys 








180 










185 










190 






Phe 


Leu 


Lys 


Thr 


Thr 


His 


Ser 


Ser 


His 


Ser 


Gly 


Gly 


Leu 


Val 


Ser 


Thr 






195 










200 










205 








Met 


Val 


Lys 


Gly 


Thr 


Asp 


Asn 


Ser 


Asn 


Asp 


Ala 


He 


Lys 


Ser 


Ala 


Leu 




210 










215 










220 










Asn 


Lys 


lie 


Phe 


Ala 


Ser 


He 


Met 


Gin 


Glu 


Met 


Asp 


Lys 


Lys 


Leu 


Thr 


225 








230 










235 










240 


Gin 


Arg 


Asn 


Leu 


Glu 


Ser 


Tyr 


Gin 


Lys 


Asp 


Ala 


Lys 


Glu 


Leu 


Lys 


Asn 










245 










250 










255 




Lys 


Arg 


Asn 


Arg 



























10 



15 



20 



25 



30 

Lys Arg Asn 

260 

35 (2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 13 82 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
50 (A) NAME/KEY: misc_feature 

(B) LOCATION 1...1382 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

55 Leu Asn Phe Asn Asn Leu Thr Ala Asn Gly Ala Leu Asn Phe Asn Gly 



BNSOOCiD- <WO 9818323A1 1 > 



wo 98/18323 



PCT/US97/19575 



- 209 - 

15 10 15 

Tyr Ala Pro Ser Leu Thr Lys Ala Leu Met Asn Val Ser Gly Gin Phe 

20 25 30 

Val Leu Gly Asn Asn Gly Asp lie Asn Leu Ser Asp lie Asn lie Phe 
5 35 40 45 

Asp Asn lie Thr Lys Ser Val Thr Tyr Asn lie Leu Asn Ala Gin Lys 

50 55 60 

Gly He Thr Gly He Ser Gly Ala Asn Gly Tyr Glu Lys He Leu Phe 
65 70 75 80 

10 Tyr Gly Met Lys lie Gin Asn Ala Thr Tyr Ser Asp Asn Asn Asn He 

85 90 95 

Gin Thr Trp Ser Phe He Asn Pro Leu Asn Ser Ser Gin He He Gin 

100 105 110 

Glu Ser He Lys Asn Gly Asp Leu Thr He Glu Val Leu Asn Asn Pro 
15 115 120 125 



Asn Ser Ala Ser Asn Thr He Phe Asn He Ala Pro Glu Leu Tyr Asn 

130 135 140 

Tyr Gin Asp Ser Lys Gin Asn Pro Thr Gly Tyr Ser Tyr Asp Tyr Ser 
145 150 155 160 

20 Asp Asn Gin Ala Gly Thr Tyr Tyr Leu Thr Ser Asn He Lys Gly Leu 

165 170 175 

Phe Thr Pro Lys Gly Ser Gin Thr Pro Gin Thr Pro Gly Thr Tyr Ser 

180 185 190 

Pro Phe Asn Gin Pro Leu Asn Ser Leu Asn He Tyr Asn Lys Gly Phe 
25 195 200 205 

Ser Ser Glu Asn Leu Lys Thr Leu Leu Gly He Leu Ser Gin Asn Ser 

210 215 220 

Ala Thr Leu Lys Glu Met He Glu Ser Asn Gin Leu Asp Asn He Thr 
225 230 235 240 

30 Asn He Asn Glu Val Leu Gin Leu Leu Asp Lys He Lys He Thr Gin 

245 250 255 

Ala Gin Lys Gin Ala Leu Leu Glu Thr He Asn His Leu Thr Asp Asn 

260 265 270 

He Asn Gin Thr Phe Asn Asn Gly Asn Leu Val He Gly Ala Thr Gin 
35 275 280 285 

Asp Asn Val Thr Asn Ser Thr Ser Ser He Trp Phe Gly Gly Asn Gly 

290 295 300 

Tyr Ser Ser Pro Cys Ala Leu Asp Ser Ala Thr Cys Ser Ser Phe Arg 
305 310 315 320 

40 Asn Thr Tyr Leu Gly Gin Leu Leu Gly Ser Thr Ser Pro Tyr Leu Gly 

325 330 335 

Tyr He Asn Ala Asp Phe Lys Ala Lys Ser He Tyr He Thr Gly Thr 

340 345 350 

He Gly Ser Ser Asn Ala Phe Glu Ser Gly Gly Ser Ala Asp Val Thr 
45 355 360 365 

Phe Gin Ser Ala Asn Asn Leu Val Leu Asn Lys Ala Asn He Glu Ala 

370 375 380 

Gin Ala Thr Asp Asn He Phe Asn Leu Leu Gly Gin Glu Gly He Asp 
385 390 395 400 

50 Lys He Phe Asn Gin Gly Asn Leu Ala Asn Val Leu Ser Gin Met Ala 

405 410 415 

Met Glu Lys He Lys Gin Ala Gly Gly Leu Gly Asn Phe He Glu Asn 

420 425 430 

Ala Leu Ser Pro Leu Ser Lys Glu Leu Pro Ala Ser Leu Gin Asp Glu 
55 435 440 445 



wo 98/18323 



PCT/US97/19575 



-210- 

Thr Leu Gly Gin Leu lie Gly Gin Asn Asn Leu Asp Asp Leu Leu Asn 

450 455 460 

Asn Ser Gly Val Met Asn Glu lie Gin Asn He He Ser Gin Lys Leu 
465 470 475 480 

5 Ser He Phe Gly Asn Phe Val Thr Pro Ser He He Glu Asn Tyr Leu 

485 490 495 

Ala Lys Gin Ser Leu Lys Ser Met Leu Asp Asp Lys Gly Leu Leu Asn 

500 505 510 

Phe He Gly Gly Tyr He Asp Ala Ser Glu Leu Ser Ser He Leu Gly 
10 515 520 525 

Val He Leu Lys Asp He Thr Asn Pro Pro Thr Ser Leu Gin Lys Asp 

530 535 540 

He Gly Val Val Ala Asn Asp Leu Leu Asn Glu Phe Leu Gly Gin Asp 
545 550 555 560 

15 Val Val Lys Lys Leu Glu Ser Gin Gly Leu Val Ser Asn He He Asn 

565 570 575 

Asn Val He Ser Gin Gly Gly Leu Ser Gly Val Tyr Asn Gin Gly Leu 

580 585 590 

Gly Ser Val Leu Pro Pro Ser Leu Gin Asn Ala Leu Lys Glu Asn Asp 
20 595 600 605 

Leu Gly Thr Leu Leu Ser Pro Arg Gly Leu His Asp Phe Trp Gin Lys 

610 615 620 

Gly Tyr Phe Asn Phe Leu Ser Asn Gly Tyr Val Phe Val Asn Asn Ser 
625 630 635 640 

25 Ser Phe Ser Asn Ala Thr Gly Gly Ser Leu Asn Phe Val Ala Asn Lys 

645 650 655 

Ser/ He He Phe Asn Gly Asp Asn Thr He Asp Phe Ser Lys Tyr Gin 

660 665 670 

Gly Ala Leu He Phe Ala Ser Asn Gly Val Ser Asn He Asn He Thr 
30 675 680 685 

Thr Leu Asn Ala Thr Asn Gly Leu Ser Leu Asn Ala Gly Leu Asn Asn 

690 695 700 

Val Ser Val Gin Lys Gly Glu He Cys He. Asn Leu Ala Asn Cys Pro 
705 710 715 720 

35 Thr Thr Lys Asn Ser Ser Pro Ala Asn Ser Ser Val Thr Pro Thr Asn 

725 730 735 

Glu Ser Leu Ser Val His Ala Asn Asn Phe Thr Phe Leu Gly Thr He 

740 745 750 

He Ser Asn Gly Ala He Asp Leu Ser Gin Val Thr Asn Asn Ser Val 
40 755 760 765 

He Gly Thr Leu Asn Leu Asn Glu Asn Ala Thr Leu Gin Ala Asn Asn 

770 775 780 

Leu Thr He Thr Asn Ala Phe Asn Asn Ala Ser Asn Ser Thr Ala Asn 
785 790 795 800 

45 He Asp Gly Asn Phe Thr Leu Asn Gin Gin Ala Thr Leu Ser Thr Asn 

805 810 815 

Ala Ser Gly Leu Asn Val Met Gly Asn Phe Asn Ser Tyr Gly Asp Leu 

820 825 830 

Val Phe Asn Leu Ser His Ser Val Ser His Ala He He Asn Thr Gin 
50 835 840 845 

Gly Thr Ala Thr He Met Ala Asn Asn Asn Pro Leu He Gin Phe Asn 

850 855 860 

Ala Ser Ser Lys Glu Val Gly Thr Tyr Thr Leu He Asp Ser Ala Lys 
865 870 875 880 

55 Ala He Tyr Tyr Gly Tyr Asn Asn Gin He Thr Gly Gly Ser Ser Leu 
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885 890 895 

Asp Asn Tyr Leu Lys Leu Tyr Ala Leu lie Asp lie Asn Gly Lys His 

900 905 910 

Met Val Met Thr Asp Asn Gly Leu Thr Tyr Asn Gly Gin Ala Val Ser 
5 915 920 925 

Val Lys Asp Gly Gly Leu Val Val Gly Phe Lys Asp Ser Gin Asn Gin 

930 935 940 

Tyr lie Tyr Thr Ser lie Leu Tyr Asn Lys Val Lys lie Ala Val Ser 
945 950 955 960 

10 Asn Asp Pro lie Asn Asn Pro Gin Ala Pro Thr Leu Lys Gin Tyr lie 

965 970 975 

Ala Gin lie Gin Gly Val Gin Ser Val Asp Ser lie Asp Gin Ala Gly 

980 985 990 

Gly Asn Gin Ala lie Asn Trp Leu Asn Lys lie Phe Glu Thr Lys Gly 
15 995 1000 1005 

-Ser ~Fro' -Leu"*Phe Alra Pro T-yr Tyr Leu "Glu "Ser Hxs^ 'Ser^ Thr Lys Asp 

1010 1015 1020 

Leu Thr Thr He Ala Gly Asp He Ala Asn Thr Leu Glu Val He Ala 
102 5 103 0 1035 1040 

20 Asn Pro Asn Phe Lys Asn Asp Ala Thr Asn He Leu Gin He Asn Thr 

1045 1050 1055 

Tyr Thr Gin Gin Met Ser Arg Leu Ala Lys Leu Ser Asp Thr Ser Thr 

1060 1065 1070 

Phe Ala Arg Ser Asp Phe Leu Glu Arg Leu Glu Ala Leu Lys Asn Lys 
25 1075 1080 1085 

Arg Phe Ala Asp Ala He Pro Asn Ala Met Asp Val He Leu Lys Tyr 

1090 1095 1100 

Ser Gin Arg Asn Arg Val Lys Asn Asn Val Trp Ala Thr Gly Val Gly 
1105 1110 1115 1120 

30 Gly Ala Ser Phe He Ser Gly Gly Thr Gly Thr Leu Tyr Gly He Asn 

1125 1130 1135 

Val Gly Tyr Asp Arg Phe He Lys Gly Val He Val Gly Gly Tyr Ala 

1140 1145 1150 

Ala Tyr Gly Tyr Ser Gly Phe His Ala Asn He Thr Gin Ser Gly Ser 
35 1155 1160 1165 

Ser Asn Val Asn Val Gly Val Tyr Ser Arg Ala Phe He Lys Arg Ser 

1170 1175 1180 

Glu Leu Thr Met Ser Leu Asn Glu Thr Trp Gly Tyr Asn Lys Thr Phe 
1185 1190 1195 1200 

40 He Asn Ser Tyr Asp Pro Leu Leu Ser He He Asn Gin Ser Tyr Arg 

1205 1210 1215 

Tyr Asp Thr Trp Thr Thr Asp Ala Lys He Asn Tyr Gly Tyr Asp Phe 

1220 1225 1230 

Met Phe Lys Asp Lys Ser Val He Phe Lys Pro Gin Val Gly Leu Ser 
45 1235 1240 1245 

Tyr Tyr Tyr He Gly Leu Ser Gly Leu Arg Gly He Met Asp Asp Pro 

1250 1255 1260 

He Tyr Asn Gin Phe Arg Ala Asn Ala Asp Pro Asn Lys Lys Ser Val 
1265 1270 1275 1280 

50 Leu Thr He Asn Phe Ala Leu Glu Ser Arg His Tyr Phe Asn Lys Asn 

1285 1290 1295 

Ser Tyr Tyr Phe Val He Ala Asp Val Gly Arg Asp Leu Phe He Asn 

1300 1305 1310 

Ser Met Gly Asp Lys Met Val Arg Phe He Gly Asn Asn Thr Leu Ser 
55 1315 1320 1325 
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Tyr Arg Asp Gly Gly Arg Tyr Asn 

1330 1335 
Gly Glu He Arg Leu Phe Lys Thr 
1345 1350 
5 Ala Arg Phe Gly Leu Asp Tyr Lys 

1365 

Gly Met Arg Tyr Ala Phe 
1380 



-212- 

Thr Phe Ala Ser lie He Thr Gly 
1340 

Phe Tyr Val Asn Ala Gly He Gly 

1355 1360 
Asp He Asn He Thr Gly Asn He 
1370 1375 



10 (2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
25 (A) NAME/KEY : misc__f eature 

(B) LOCATION 1...262 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



30 


Met 


Lys 


Lys 


He 


Gly 


Leu 


Ser 


Leu 


Cys 


Leu 


Val 


Leu 


Ser 


Leu 


Gly 


Phe 




1 






5 










10 










15 






Leu 


Lys 


Ala 


His 


Glu 


Val 


Ser 


Ala 


Glu 


Glu 


He 


Ala 


Asp 


He 


Phe 


Tyr 








20 










25 










30 








Lys 


Leu 


Asn 


Ala 


Lys 


Glu 


Pro 


Lys 


Met 


Lys 


He 


Asn 


His 


Thr 


Lys 


Gly 


35 




35 










40 










45 










Phe 


Cys 
50 


Ala 


Lys 


Gly 


Val 


Phe 
55 


Leu 


Pro 


Asn 


Pro 


Gin 
60 


Ala 


Arg 


Glu 


Asp 




Leu 


Glu 


Val 


Pro 


Leu 


Leu 


Asn 


Glu 


Lys 


Glu 


He 


Pro 


Ala 


Ser 


Val 


Arg 




65 










70 










75 










80 


40 


Tyr 


Ser 


Leu 


Gly 


Gly 


Val 


Ala 


Met 


Asp 


Asp 


Lys 


Ser 


Lys 


Val 


Arg 


Gly 










85 










90 










95 






Met 


Ala 


Leu 


Lys 
100 


Leu 


Glu 


Asn 


Gin 


Asn 
105 


Ala 


Ser 


Trp 


Thr 


Met 
110 


Val 


Met 




Leu 


Asn 


Thr 


Glu 


He 


Asn 


Phe 


Ala 


Lys 


Asn 


Pro 


Glu 


Glu 


Phe 


Ala 


Gin 


45 






115 










120 










125 










Phe 


Phe 


Glu 


Met 


Arg 


Leu 


Pro 


Lys 


Asn 


Gly 


Lys 


Val 


Asp 


Glu 


Ala 


Arg 






130 










135 










140 








Ala 




He 


Lys 


Lys 


Leu 


Tyr 


Glu 


Glu 


Val 


Pro 


Ser 


Tyr 


Arg 


Asn 


Phe 


Ala 




145 








150 










155 










160 


50 


Tyr 


Met 


Lys 


Thr 


He 


Gly 


He 


Ser 


Ser 


Ser 


Val 


Ala 


Asn 


Thr 


Pro 


Tyr 










165 










170 










175 






Tyr 


Ser 


Val 


His 


Ala 


Phe 


Lys 


Phe 


Lys 


Asp 


Lys 


Lys 


Glu 


Lys 


Leu 


Leu 








180 










185 










190 








Pro 


Ala 


Arg 


Trp 


Lys 


Phe 


Val 


Pro 


Lys 


Glu 


Gly Val 


Lys 


Tyr 


Leu 


Asn 


55 






195 










200 










205 
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Pro Gin Glu Leu Lys Gin Lys Asp Ser Asn Tyr Leu Leu Ser Ser Phe 

210 215 220 

Gin Gin His Leu Lys Asn Lys Pro lie Glu Tyr Gin Met Tyr Leu Val 

225 230 235 240 

5 Phe Ala Asn Gin Asn Asp Ala Thr Asn^ Asp Thr Thr Ala Leu Trp Lys 

245 250 255 

Gly Ser lie Arg Asn Tyr 
260 

10 (2) INFORMATION FOR SEQ ID NO: 133: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( i i i ) HYPOTHET I C AL : YES 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
25 (A) NAME/KEY: misc_feature 

(B) LOCATION 1...246 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 



30 


Met 
1 


Lys 


Gin 


Phe 


Lys 
5 


Lys 


Lys 


Pro 


Lys 


Lys 
10 


He 


Lys 


Arg 


Ser 


His 
15 


Gin 




Asn 


Gin 


Lys 


Thr 
20 


He 


Leu 


Lys 


Arg 


Pro 
25 


Leu 


Trp 


Leu 


Met 


Pro 
30 


Leu 


Leu 




He 


Gly 


Gly 


Phe 


Ala 


Ser 


Gly 


Val 


Tyr 


Ala 


Asp 


Gly 


Thr 


Asp 


He 


Leu 


35 






35 










40 










45 










Gly 


Leu 
50 


Ser 


Trp 


Gly 


Glu 


Lys 
55 


Ser 


Gin 


Lys 


Val 


Cys 
60 


Val 


His 


Arg 


Pro 




Trp 


Tyr 


Ala 


He 


Trp 


Ser 


Cys 


Asp 


Lys 


Trp 


Glu 


Glu 


Lys 


Thr 


Gin 


Gin 




65 










70 










75 










80 


40 


Phe 


Thr 


Gly 


Asn 


Gin 
85 


Leu 


He 


Thr 


Lys 


Thr 
90 


Trp 


Ala 


Gly 


Gly 


Asn 
95 


Ala 




Ala 


Asn 


Tyr 


Tyr 
100 


His 


Ser 


Gin 


Asn 


Asn 
105 


Gin 


Asp 


He 


Thr 


Ala 
110 


Asn 


Leu 




Lys 


Asn 


Asp 


Asn 


Gly 


Thr 


Tyr 


Phe 


Leu 


Ser 


Gly 


Leu 


Tyr 


Asn 


Tyr 


Thr 


45 






115 










120 










125 










Gly Gly Glu 


Tyr 


Asn 


Gly 


Gly 


Asn 


Leu 


Asp 


He 


Glu 


Leu 


Gly 


Ser 


Asn 






130 










135 










140 












Ala 


Thr 


Phe 


Asn 


Leu 


Gly 


Ala 


Ser 


Ser 


Gly 


Asn 


Ser 


Phe 


Thr 


Ser 


Trp 




14 5 










150 










155 










160 


50 


Tyr 


Pro 


Asn 


Gly 


His 
165 


Thr 


Asp 


Val 


Thr 


Phe 
170 


Ser 


Ala 


Gly 


Thr 


He 
175 


Asn 




Val 


Asn 


Asn 


Ser 
180 


Val 


Glu 


Val 


Gly 


Asn 
185 


Arg 


Val 


Gly 


Ser 


Gly 
190 


Ala 


Gly 




Thr 


His 


Thr 


Gly 


Thr 


Ala 


Thr 


Leu 


Asn 


Leu 


Asn 


Ala 


Asn 


Lys 


Val 


Thr 


55 






195 










200 










205 
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Ile Asn Ser Asn He Ser Ala Tyr Lys Thr Ser Gin Val Asn Val Gly 

210 215 220 

Asn Ala Asn Ser Val He Thr He Asn Ser Val Ser Leu Asn Gly Glu 
225 230 235 240 

5 Tyr Leu Gin Phe Phe Ser 

245 

(2) INFORMATION FOR SEQ ID NO: 134: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

15 (ii) MOLECXHjE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...245 



25 



30 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:134: 



Met 


He 


Lys 


Lys 


Thr 


Leu 


Ala 


Ser 


Val 


Leu 


Leu 


Gly 


Leu 


Ser 


Leu 


Met 


1 




5 










10 










15 




Ser 


Val 


Leu 


Asn 
20 


Ala 


Lys 


Glu 


Cys 


Val 
25 


Ser 


Pro 


He 


Thr 


Arg 
30 


Ser 


Val 


Lys 


Tyr 


His 


Gin 


Gin 


Ser 


Ala 


Glu 


He 


Arg 


Ala 


Leu 


Gin 


Leu 


Gin 


Ser 


35 










40 










45 




Val 




Tyr 


Lys 


Met 


Ala 


Lys 


Met 


Ala 


Leu 


Asp 


Asn 


Asn 


Leu 


Lys 


Leu 


Lys 


50 










55 










60 










Asp 


Lys 


Lys 


Pro 


Ala 


Val 


He 


Leu 


Asp 


Leu 


Asp 


Glu 


Thr 


Val 


Leu 


Asn 


65 








70 










75 










80 


Thr 


Phe 


Asp 


Tyr 


Ala 


Gly 


Tyr 


Leu 


Val 


Lys 


Asn 


Cys 


He 


Lys 


Tyr 


Thr 






85 










90 










95 




Pro 


Glu 


Thr 


Trp 
100 


Asp 


Lys 


Phe 


Glu 


Lys 
105 


Glu 


Gly 


Ser 


Leu 


Thr 
110 


Leu 


He 


Pro 


Gly 


Ala 


Leu 


Asp 


Phe 


Leu 


Glu 


Tyr 


Ala 


Asn 


Ser 


Lys 


Gly 


Val 


Lys 




115 










120 










125 








He 


Phe 


Tyr 


He 


Ser 


Asn 


Arg 


Thr 


Gin 


Lys 


Asn 


Lys 


Ala 


Phe 


Thr 


Leu 




130 








135 










14 0 








Val 


Lys 


Thr 


Leu 


Lys 


Ser 


Phe 


Lys 


Leu 


Pro 


Gin 


Val 


Ser 


Glu 


Glu 


Ser 


145 










150 










155 










160 


Leu 


Leu 


Lys 


Glu 


Lys 


Gly 


Lys 


Pro 


Lys 


Ala 


Val 


Arg 


Arg 


Glu 


Leu 


Val 








165 










170 










175 




Ala 


Lys 


Asp 


Tyr 


Ala 


He 


Val 


Leu 


Gin 


Val 


Gly Asp 


Thr 


Leu 


His 


Asp 




180 










185 










190 






Phe 


Asp 


Ala 


He 


Phe 


Ala 


Lys 


Asp 


Ala 


Lys 


Asn 


Ser 


Gin 


Glu 


Gin 


Gin 




195 










200 










205 






He 


Ala 


Lys 
210 


Val 


Leu 


Gin 


Asn 


Ala 
215 


Gin 


Lys 


Phe 


Gly 


Thr 
220 


Glu 


Trp 


He 
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Leu Pro Asn Ser Leu Tyr Gly Thr Trp Glu Asp Gly Pro lie Lys Ala 
225 230 235 240 

Trp Gin Asn Lys Lys 
245 

5 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 amino acids 
10 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECXKiE TYPE: protein 

15 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

20 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...288 



25 



30 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 



Leu 


Trp 


Cys 


Leu 


Lys 


Thr 


Pro 


He 


He 


Gly 


His 


Gly 


Met 


Lys 


Lys 


Lys 


1 








5 










10 










15 




Ala 


Lys 


Val 


Phe 


Trp 


Cys 


Cys 


Phe 


Lys 


Met 


He 


Arg 


Trp 


Leu 


Tyr 


Leu 








20 










25 










30 






Ala 


Val 


Phe 


Phe 


Leu 


Leu 


Ser 


Val 


Ser 


Asp 


Ala 


Lys 


Glu 


He 


Ala 


Met 






35 










40 










45 








Gin 


Arg 


Phe 


Asp 


Lys 


Gin 


Asn 


His 


Lys 


He 


Phe 


Glu 


He 


Leu 


Ala 


Asp 




50 










55 










60 










Lys 


Val 


Ser 


Ala 


Lys 


Asp 


Asn 


Val 


He 


Thr 


Ala 


Ser 


Gly 


Asn 


Ala 


He 


65 










70 










75 










80 


Leu 


Leu 


Asn 


Tyr 


Asp 


Val 


Tyr 


He 


Leu 


Ala 


Asp 


Lys 


Val 


Arg 


Tyr 


Asp 










85 










90 










95 




Thr 


Lys 


Thr 


Lys 


Glu 


Ala 


Leu 


Leu 


Glu 


Gly 


Asn 


He 


Lys 


Val 


Tyr 


Arg 








100 










105 










110 






Gly 


Glu 


Gly 


Leu 


Leu 


Val 


Lys 


Thr 


Asp 


Tyr 


Val 


Lys 


Leu 


Ser 


Leu 


Asn 






115 










120 










12 5 








Glu 


Lys 


Tyr 


Glu 


He 


He 


Phe 


Pro 


Phe 


Tyr 


Val 


Gin 


Asp 


Ser 


Val 


Ser 




130 










135 










140 










Gly 


lie 


Trp 


Val 


Ser 


Ala 


Asp 


He 


Ala 


Ser 


Gly 


Lys 


Asp 


Gin 


Lys 


Tyr 


145 










150 










155 










160 


Lys 


He 


Lys 


Asn 


Met 


Ser 


Ala 


Ser 


Gly 


Cys 


Ser 


He 


Asp 


Asn 


Pro 


He 










165 










170 










175 




Trp 


His 


Val 


Asn 


Ala 


Thr 


Ser 


Gly 


Ser 


Phe 


Asn 


Met 


Gin 


Lys 


Ser 


His 








180 










185 










190 






Leu 


Ser 


Met 


Trp 


Asn 


Pro 


Lys 


He 


Tyr 


Val 


Gly 


Asp 


He 


Pro 


Val 


Leu 






195 










200 










205 








Tyr 


Leu 


Pro 


Tyr 


He 


Phe 


Met 


Ser 


Thr 


Ser 


Asn 


Lys 


Arg 


Thr 


Thr 


Gly 




210 










215 










220 










Phe 


Leu 


Tyr 


Pro 


Glu 


Phe 


Gly 


Thr 


Ser 


Asn 


Leu 


Asp 


Gly 


Phe 


He 


Tyr 


225 










230 










235 










240 
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Leu Gin Pro Phe Tyr Leu Ala Pro Lys Asn Ser Trp Asp Met Thr Phe 

245 250 255 

Thr Pro Gin lie Arg Tyr Lys Arg Gly Phe Gly Leu Asn Phe Glu Ala 
260 265 270 

5 Arg Tyr lie Asn Ser Lys Thr Gin Val Phe lie Gin Cys Ala Leu Phe 
275 280 285 

(2) INFORr4ATI0N FOR SEQ ID NO: 136: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

15 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Helicobacter pylori 

{ ix ) FEATURE : 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1...128 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 





Leu 


Met 


Phe 


Lys 


Lys 


Met 


Cys 


Leu 


Ser 


Leu 


Leu 


Met 


He 


Ser 


Gly Val 




1 








5 










10 










15 




30 


Cys 


Val 


Gly 


Ala 


Lys 


Asp 


Leu 


Asp 


Phe 


Lys 


Leu Asp 


Tyr 


Arg 


Ala 


Thr 










20 










25 










30 








Gly Gly Lys 


Phe 


Met 


Gly 


Lys 


Met 


Thr 


Asp 


Ser 


Ser 


Leu 


Leu 


Ser 


He 








35 










40 










45 










Thr 


Ser 


Met 


Asn 


Asp 


Glu 


Pro 


Val 


Val 


He 


Lys 


Asn 


Leu 


He 


Val 


Asn 


35 




50 










55 










60 












Arg 


Gly 


Asn 


Ser 


Cys 


Glu 


Ala 


Thr 


Lys 


Lys 


Val 


Glu 


Pro 


Lys 


Phe 


Gly 




65 










70 










75 










80 




Asp 


Lys 


Phe 


Lys 


Lys 
85 


Glu 


Lys 


Leu 


Phe 


Asp 
90 


His 


Glu 


Leu 


Lys 


Tyr 
95 


Ser 


40 


Gin 


Gin 


lie 


Phe 
100 


Tyr 


Arg 


Leu 


Asp 


Cys 
105 


Lys 


Pro 


Asn 


Gin 


Leu 
110 


Leu 


Glu 




Val 


Lys 


lie 
115 


lie 


Thr 


Asp 


Lys 


Gly 
120 


Glu 


Tyr 


Tyr 


His 


Lys 
125 


Phe 


Ser 


Lys 



45 (2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 
50 ( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) H YPOTHET I CAL : YE S 

55 
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50 



-217- 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...169 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



10 


Met 
1 


Gin 


Ala 


Leu 


Lys 
5 


Ser 


Leu 


Leu 


Glu 


Val 
10 


He 


Thr 


Lys 


Leu 


Gin 
15 


Asn 




Leu 


Gly 


Gly 


Tyr 
20 


Leu 


Met 


His 


He 


Ala 
25 


He 


Phe 


He 


He 


Phe 
30 


He 


Trp 




He 


Gly Gly 


Leu 


Lys 


Phe 


Val 


Pro 


Tyr 


Glu 


Ala 


Glu 


Gly 


He 


Ala 


Pro 


15 






35 










40 










45 










Phe~ 


Val- 
50 


-Ala- 


""A'sn" 


~Ser~ 


"Pro" 


-phe- 
55 


-pKe~ 


"Se^r^ 


"Phe" 


"Met 


"Tyr 
60 


'Lys 


Phe 


Glu 


Lys 




Pro 


Ala 


Tyr 


Lys 


Gin 


His 


Lys 


Met 


Ser 


Glu 


Ser 


Gin 


Ser 


Met 


Gin 


Glu 




65 










70 










75 










80 


20 


Glu 


Met 


Gin 


Asp 


Asn 
85 


Pro 


Lys 


He 


Val 


Glu 
90 


Asn 


Lys 


Glu 


Trp 


His 
95 


Lys 




Glu 


Asn 


Arg 


Thr 
100 


Tyr 


Leu 


Val 


Ala 


Glu 
105 


Gly 


Leu 


Gly 


He 


Thr 
110 


He 


Met 




He 


Leu 


Gly 


He 


Leu 


Val 


Leu 


Leu 


Gly 


Leu 


Trp 


Met 


Pro 


Leu 


Met 


Gly 


25 






115 










120 










125 










Val 


Val 
130 


Gly 


Gly 


Leu 


Leu 


Val 
135 


Ala 


Gly 


Met 


Thr 


He 
140 


Thr 


Thr 


Leu 


Phe 




Phe 


Phe 


He 


His 


Asn 


Ala 


Arg 


Ser 


Val 


Cys 


Gin 


Ser 


Ala 


Phe 


Pro 


Met 




145 










150 










155 










160 


30 


Ala 


Phe 


Trp 


Gly 


Trp 


Lys 


Ala 


Ser 


Gly 

















165 

(2) INFORMATION FOR SEQ ID NO: 13 8: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 87 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...487 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: 



Met He Glu Trp Met Gin Asn His Arg Lys Tyr Leu Val Val Thr He 
15 10 15 

55 Trp He Ser Thr He Ala Phe He Ala Ala Gly Met He Gly Trp Gly 
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20 25 30 

Gin Tyr Ser Phe Ser Leu Asp Ser Asp Ser Ala Ala Lys Val Gly Gin 

35 40 45 

lie Lys He Ser Gin Glu Glu Leu Ala Gin Glu Tyr Arg Arg Leu Lys 
5 50 55 60 

Asp Ala Tyr Ala Glu Ser He Pro Asp Phe Lys Glu Leu Thr Glu Asp 
65 70 75 80 

Gin He Lys Ala Met His Leu Glu Lys Ser Ala Leu Asp Ser Leu He 
85 90 95 

10 Asn Gin Ala Leu Leu Arg Asn Phe Ala Leu Asp Leu Gly Leu Gly Ala 
100 105 110 

Thr Lys Gin Glu Val Ala Lys Glu He Arg Lys Thr Asn Val Phe Gin 

lis 120 125 

Lys Asp Gly Val Phe Asp Glu Glu Leu Tyr Lys Asn He Leu Lys Gin 
15 130 135 140 

Ser His Tyr Arg Pro Lys His Phe Glu Glu Ser Val Glu Arg Leu Leu 
145 150 155 160 

He Leu Gin Lys He Ser Ala Leu Phe Pro Lys Thr Thr Thr Pro Leu 
165 170 175 

20 Glu Gin Ser Ser Leu Ser Leu Trp Ala Lys Leu Gin Asp Lys Leu Asp 
180 185 190 

He Leu He Leu Asn Pro Asn Asp Val Lys He Ser Leu Asn Glu Glu 

195 200 205 

Glu Met Lys Lys Tyr Tyr Glu Asn His Arg Lys Asp Phe Lys Lys Pro 
25 210 215 220 

Thr Ser Phe Lys Thr Arg Ser Leu Tyr Phe Asp Ala Ser Leu Glu Lys 
225 230 235 240 

Thr Asp Leu Lys Glu Leu Glu Glu Tyr Tyr His Lys Asn Lys Val Ser 
245 250 255 

30 Tyr Leu Asp Lys Glu Gly Lys Leu Gin Asp Phe Lys Ser Val Gin Glu 
260 265 270 

Gin Val Lys His Asp Leu Asn Met Gin Lys Ala Asn Glu Lys Ala Leu 

275 280 285 

Arg Ser Tyr He Ala Leu Lys Lys Gly Asn Ala Gin Asn Tyr Thr Thr 
35 290 295 300 

Gin Asp Phe Glu Lys Asn Asn Ser Pro Tyr Thr Ala Glu He Thr Gin 
305 310 315 320 

Lys Leu Thr Ala Leu Lys Pro Leu Glu Val Leu Lys Pro Glu Pro Phe 
325 330 335 

40 Lys Asp Gly Phe He Val Val Gin Leu Val Ser Gin He Lys Asp Glu 
340 345 350 

Leu Gin Asn Phe Asp Glu Ala Lys Ser Ala Leu Lys Thr Arg Leu Thr 

355 360 365 

Gin Glu Lys Thr Leu Met Ala Leu Gin Thr Leu Ala Lys Glu Lys Leu 
45 370 375 380 

Lys Asp Phe Lys Gly Lys Ser Val Gly Tyr Val Ser Pro Asn Phe Gly 
385 390 395 400 

Gly Thr He Ser Glu Leu Asn Gin Glu Glu Ser Ala Lys Phe He Asn 
405 410 415 

50 Thr Leu Phe Asn Arg Gin Glu Lys Lys Gly Phe Val Thr He Gly Asn 
420 425 430 

Lys Val Val Leu Tyr Gin He Thr Glu Gin Asn Phe Asn His Pro Phe 

435 440 445 

Ser Ala Glu Glu Asn Gin Tyr Met Gin Arg Leu Val Asn Asn Thr Lys 
55 450 455 460 
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Thr Asp Phe Phe Asp Lys Ala Leu lie Glu Glu Leu Lys Lys Arg Tyr 

465 470 475 480 

Lys lie Val Lys Tyr lie Gin 
485 

5 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

20 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...142 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



25 



30 



35 



40 



45 (2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



Met 


Lys 


Thr 


Asn 


Phe 


Tyr 


Lys 


He 


Lys 


Leu 


Leu 


Phe 


Ala 


Trp 


Cys 


Leu 


1 








5 










10 










15 




He 


He 


Gly 


Met 
20 


Phe 


Asn 


Ala 


Pro 


Leu 
25 


Asn 


Ala 


Asp 


Gin 


Asn 
30 


Thr 


Asp 


He 


Lys 


Asp 
35 


He 


Ser 


Pro 


Glu 


Asp 
40 


Met 


Ala 


Leu 


Asn 


Ser 
45 


Val 


Gly 


Leu 


Val 


Ser 
50 


Arg 


Asp 


Gin 


Leu 


Lys 
55 


He 


Glu 


He 


Pro 


Lys 
60 


Glu 


Thr 


Leu 


Glu 


Gin 


Lys 


Val 


Ala 


He 


Leu 


Asn 


Asp 


Tyr Asn 


Asp 


Lys 


Asn 


Val 


Asn 


He 


65 










70 










75 










80 


Lys 


Phe 


Asp 


Asp 


He 
85 


Ser 


Leu 


Gly 


Ser 


Phe 
90 


Gin 


Pro 


Asn 


Asp 


Asn 
95 


Leu 


Gly 


He 


Asn 


Ala 
100 


Met 


Trp 


Gly 


He 


Gin 
105 


Asn 


Leu 


Leu 


Met 


Ser 
110 


Gin 


Met 


Met 


Ser 


Asn 
115 


Tyr 


Gly 


Pro 


Asn 


Asn 
120 


Ser 


Phe 


Met 


Tyr 


Gly 
125 


Tyr 


Ala 


Pro 


Thr 


Tyr 
130 


Ser 


Asp 


Ser 


Ser 


Phe 
135 


Leu 


Pro 


Pro 


He 


Leu 
140 


Gly 


Tyr 







55 



(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...208 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:140: 



Leu 


lie 


Asn 


Asn 


Asn 


Asn 


Asn 


Asn 


Lys 


Lys 


Leu 


Arg 


Gly 


Phe 


Phe 


Leu 


1 








5 










10 










15 




Lys 


Val 


Leu 


Leu 


Ser 


Leu 


Val 


Val 


Phe 


Ser 


Ser 


Tyr 


Gly 


Ser 


Ala 


Asn 






20 










25 










30 






Asp 


Asp 


Lvs 
35 


Glu 


Ala 


Lys 


Lys 


Glu 
40 


Ala 


Leu 


Glu 


Lys 


Glu 
45 


Lys 


Asn 


Thr 


Pro 


Asn 
50 


Gly 


Leu 


Val 


Tyr 


Thr 
55 


Asn 


Leu 


Asp 


Phe 


Asp 
60 


Ser 


Phe 


Lys 


Ala 


Thr 


He 


Lys 


Asn 


Leu 


Lys 


Asp 


Lys 


Lys 


Val 


Thr 


Phe 


Lys 


Glu 


Val 


Asn 


65 










70 










75 










80 


Pro 


Asp 


He 


He 


Lys 
85 


Asp 


Glu 


Val 


Phe 


Asp 
90 


Phe 


Val 


He 


Val 


Asn 
95 


Arg 


Val 


Leu 


Lys 


Lys 
100 


He 


Lys 


Asp 


Leu 


Lys 
105 


His 


Tyr 


Asp 


Pro 


Val 
110 


He 


Glu 


Lys 


He 


Phe 


Asp 


Glu 


Lys 


Gly 


Lys 


Glu 


Met 


Gly 


Leu 


Asn 


Val 


Glu 


Leu 




115 










120 










125 








Gin 


He 
130 


Asn 


Pro 


Glu 


Val 


Lys 
135 


Asp 


Phe 


Phe 


Thr 


Phe 
140 


Lys 


Ser 


He 


Ser 


Thr 


Thr 


Asn 


Lys 


Gin 


Arg 


Cys 


Phe 


Leu 


Ser 


Leu 


His 


Gly 


Glu 


Thr 


Arg 


145 










150 










155 










160 


Glu 


He 


Leu 


Cys 


Asp 
165 


Asp 


Lys 


Leu 


Tyr 


Asn 
170 


Val 


Leu 


Leu 


Ala 


Val 
175 


Phe 


Asn 


Ser 


Tyr 


Asp 
180 


Pro 


Asn 


Asp 


Leu 


Leu 
185 


Lys 


His 


He 


Ser 


Thr 
190 


He 


Glu 


Ser 


Leu 


Lys 
195 


Lys 


He 


Phe 


Tyr 


Thr 
200 


He 


Thr 


Cys 


Glu 


Ala 
205 


Val 


Tyr 


Leu 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1.,.245 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
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Met Ala Gly Thr Gin Ala lie Tyr Glu Ser Ser Ser Ala Gly Phe Leu 
15 10 15 

Ser Gin Val Ser Ser lie lie Ser Ser Thr Ser Gly Val Ala Gly Pro 
5 20 25 30 

Phe Ala Gly lie Val Ala Gly Ala Met Thr Ala Ala lie lie Pro lie 

35 40 45 

Val Val Gly Phe Thr Asn Pro Gin Met Thr Ala lie Met Thr Gin Tyr 
50 55 60 

10 Asn Gin Ser lie Ala Glu Ala Val Ser Val Pro Met Lys Ala Ala Asn 
65 70 75 80 

Gin Gin Tyr Asn Gin Leu Tyr Gin Gly Phe Asn Asp Gin Ser Met Ala 

85 90 95 

Val Gly Asn Asn lie Leu Asn lie Ser Lys Leu Thr Gly Glu Phe Asn 
15 100 105 110 _^ 

Ala Gin: Gly "Asn Thr Gin Ser^'Ala'Gln lie Ser Ala Val Asn Ser Gin 

115 120 125 

lie Ala Ser lie Leu Ala Ser Asn Thr Thr Pro Lys Asn Pro Ser Ala 
130 135 140 

20 lie Glu Ala Tyr Ala Thr Asn Gin lie Ala Val Pro Ser Val Pro Thr 
145 150 155 160 

Thr Val Glu Met Met Ser Gly lie Leu Gly Asn lie Thr Ser Ala Ala 

165 170 175 

Pro Lys Tyr Ala Leu Ala Leu Gin Glu Gin Leu Arg Ser Gin Ala Ser 
25 180 IBS 190 

Asn Ser Ser Met Asn Asp Thr Ala Asp Ser Leu Asp Ser Cys Thr Ala 

195 200 205 

Leu Gly Ala Leu Val Gly Ser Ser Lys Val Phe Phe Ser Cys Met Gin 
210 215 220 

30 He Ser Met Thr Pro Met Ser Val Ser Met Pro Thr Val Met Pro Asn 
225 230 235 240 

Thr Ser Gly Cys His 
245 

35 (2) INFORMATION FOR SEQ ID NO:142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 67 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{ i i i ) HYPOTHETI CAL : YES 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
50 (A) NAME/KEY: misc_feature 

(B) LOCATION 1...367 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 142 ; 
Met lie Lys Ser Val Glu lie Glu Asn Tyr Lys Asn Phe Glu His Leu 
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Lys Met Glu Asn Phe 
20 




10 

Lys Leu lie Asn Phe Phe 
25 



Thr Gly Gin 
30 



15 
Asn 



Asp 



Ala Gly Lys Thr Asn Leu Leu Glu Ala Leu Tyr Thr Asn Thr Gly Leu 

35 40 45 

Cys Asp Pro Thr Ala Asn Gin Val Ser Leu Pro Pro Glu His Ala Val 

50 55 60 

Asn lie Ser Glu Phe Arg Lys lie Lys Leu Asp Ala Asp Asn Leu Lys 



10 Thr Phe Phe Tyr Gin Gly Asn Thr Ala Asn Pro lie Ser lie Arg Thr 

85 90 95 

Glu Phe Glu His Ala Thr lie Pro Leu Thr lie Gin Tyr Pro Thr Gin 

100 105 110 

Thr Ser Tyr Ser Lys Asp lie Asn Leu Asn Ser Asp Asp Ala His Met 
15 115 120 125 

Thr Asn Leu lie Asn Thr Thr lie Thr Lys Pro Gin Leu Gin Phe Ser 

130 135 140 

Tyr Asn Pro Ser Leu Ser Pro Met Thr Met Thr Tyr Glu Phe Glu Arg 
145 150 155 160 

20 Gin Asn Leu Gly Leu lie His Ser Asn Leu Asp Lys lie Ala Gin Thr 

165 170 175 

Tyr Lys Glu Asn Ala Met Phe lie Pro He Glu Leu Ser He Val Asn 

180 185 190 

Ser Leu Lys Ala Leu Glu Asn Leu Gin Leu Ala Ser Lys Glu Lys Glu 
25 195 200 205 

Leu He Glu He Leu Gin Cys Phe Asn Pro Asn He Leu Asn Ala Asn 

210 215 220 

Thr He Arg Lys Ser Val Tyr He Gin He Lys Asp Glu Asn Thr Pro 
225 230 235 240 

30 Leu Glu Glu Ser Pro Lys Arg Leu Leu Asn Leu Phe Gly Trp Gly Phe 

245 250 255 

He Lys Phe Phe He Met Val Ser He Leu He Asp Asn Arg Val Lys 

260 265 270 

Tyr Leu Phe He Asp Glu He Glu Ser Gly Leu His His Thr Lys Met 
35 275 280 285 

Gin Glu Phe Leu Lys Ala Leu Phe Lys Leu Ala Gin Lys Leu Gin He 

290 295 300 

Gin He Phe Ala Thr Thr His Asn Lys Glu Phe Leu Leu Asn Ala He 
305 310 315 320 

40 Asn Thr He Ser Asp Asn Glu Thr Gly Val Phe Lys Asp He Ala Leu 

325 330 335 

Phe Glu Leu Glu Lys Glu Ser Ala Ser Gly Phe He Arg His Ser Tyr 

340 345 350 

Ser Met Leu Glu Lys Ala Leu Tyr Arg Gly Met Glu Val Arg Gly 
45 355 360 365 

(2) INFORMATION FOR SEQ ID NO:143: 



65 



70 



75 



80 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



55 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

5 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .409 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 

Met Ser Leu lie Arg Val Asn Gly Glu Ala Phe Lys Leu Ser Leu Glu 

15 10 15 

Ser Leu Glu Glu Asp Pro Phe Glu Thr Lys Glu Thr Leu Glu Thr Leu 

15 20^ 25 30 

~ Glu Thr Leu lie Lys Gin Thr Ser Val Val Leu Leu Ala Ala Gly Glu 
35 40 45 

Ser Lys Arg Phe Ser Arg Ala lie Lys Lys Gin Trp Leu Arg Ser His 
50 55 60 

20 His Thr Pro Leu Trp Leu Ser Val Tyr Glu Ser Phe Lys Glu Ala Leu 
65 70 75 80 

Asp Phe Lys Glu Val lie Leu Val Val Ser Glu Leu Asp Tyr Val Tyr 

85 90 95 

lie Gin Arg His Tyr Pro Lys lie Lys Leu Val Lys Gly Gly Ala Ser 
25 100 105 110 

Arg Gin Glu Ser Val Arg Asn Ala Leu Lys Val lie Asp Ser Thr Tyr 

115 120 125 

Thr lie Thr Ser Asp Val Ala Arg Gly Leu Ala Asn Met Glu Ala Leu 
130 135 140 

30 Lys Ser Leu Phe Leu Thr Leu Gin Gin Thr Ser His Tyr Cys lie Ala 
145 150 155 160 

Pro Tyr Leu Pro Cys Tyr Asp Thr Ala lie Tyr Tyr Asn Glu Ala Leu 

165 170 175 

Asp Arg Glu Ala lie Lys Leu lie Gin Thr Pro Gin Leu Ser His Thr 
35 180 185 190 

Lys Thr Leu Gin Ser Ala Leu Asn Gin Gly Gly Phe Lys Asp Glu Ser 

195 200 205 

Ser Ala lie Leu Gin Ala Phe Pro Asn Ser Val Ser Tyr lie Glu Gly 
210 215 220 

40 Ser Lys Asp Leu His Lys Leu Thr Thr Ser Gly Asp Leu Lys Phe Phe 
225 230 235 240 

Thr Pro Phe Phe Asn Pro Ala Lys Asp Thr Phe lie Gly Met Gly Phe 

245 250 255 

Asp Thr His Ala Phe lie Lys Asp Lys Pro Met Val Leu Gly Gly Val 
45 260 265 270 

Val Leu Asp Cys Glu Phe Gly Leu Lys Ala His Ser Asp Gly Asp Ala 

275 280 285 

Leu Leu His Ala Val lie Asp Ala lie Leu Gly Ala lie Lys Gly Gly 
290 295 300 

50 Asp lie Gly Glu Trp Phe Pro Asp Asn Asp Pro Lys Tyr Lys Asn Ala 
305 310 315 320 

Ser Ser Lys Glu Leu Leu Lys lie Val Leu Asp Phe Ser Gin Ser lie 

325 330 335 

Gly Phe Glu Leu Leu Glu Met Gly Ala Thr lie Phe Ser Glu lie Pro 
55 340 345 350 
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Lys lie Thr Pro Tyr Lys Pro Ala He Leu Glu Asn Leu Ser Gin Leu 

355 360 365 

Leu Gly Leu Glu Lys Ser Gin He Ser Leu Lys Ala Thr Thr Met Glu 
370 375 380 

5 Lys Met Gly Phe He Gly Lys Gin Glu Gly Leu Leu Val Gin Ala His 
385 390 395 400 

Val Ser Met Arg Tyr Lys Gin Lys Leu 
405 

10 (2) INFORMATION FOR SEQ ID NO : 144 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 0 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{ iii ) HYPOTHETICAL : YES 



20 



30 



35 



40 



45 



50 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
25 (A) NAME/KEY: misc_feature 

(B) LOCATION 1,..270 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:144: 



Met 


Lys 


Lys 


Phe 


Val 


Ala 


Leu 


Gly 


Leu 


Leu 


Ser 


Ala 


Val 


Leu 


Ser 


Ser 


1 




5 










10 










15 




Ser 


Leu 


Leu 


Ala 
20 


Glu 


Gly 


Asp 


Gly 


Val 
25 


Tyr 


He 


Gly 


Thr 


Asn 
30 


Tyr 


Gin 


Leu 


Gly 


Gin 


Ala 


Arg 


Leu 


Asn 


Ser 


Asn 


He 


Tyr 


Asn 


Thr 


Gly 


Asp 


Cys 




35 










40 










45 






His 


Thr 


Gly 
50 


Ser 


Val 


Val 


Gly 


Cys 
55 


Pro 


Pro 


Gly 


Leu 


Thr 
60 


Ala 


Asn 


Lys 


Asn 


Pro 


Gly 


Gly 


Thr 


Asn 


He 


Asn 


Trp 


His 


Ser 


Lys 


Tyr 


Ala Asn Gly 


65 








70 










75 










80 


Ala 


Leu 


Asn 


Gly 


Phe 


Gly 


Leu 


Asn 


Val 


Gly 


Tyr 


Lys 


Lys 


Phe 


Phe 


Gin 








85 










90 










95 




Phe 


Lys 


Ser 


Leu 


Asp 


Met 


Thr 


Ser 


Lys 


Trp 


Phe 


Gly 


Phe 


Arg 


Val 


Tyr 






100 










105 










110 






Gly 


Leu 


Phe 


Asp 


Tyr 


Gly 


His 


Ala 


Asp 


Leu 


Gly 


Lys 


Gin 


Val 


Tyr 


Ala 




115 










120 










125 








Pro 


Asn 


Lys 


He 


Gin 


Leu 


Asp 


Met 


Val 


Ser 


Trp 


Gly 


Val 


Gly 


Ser 


Asp 




130 








135 










140 








Phe 


Leu 


Leu 


Ala 


Asp 


He 


He 


Asp 


Lys 


Asp 


Asn 


Ala 


Ser 


Phe 


Gly 


He 


145 








150 










155 










160 


Gly 


Gly 


Val 


Ala 


He 


Gly 


Gly 


Asn 


Thr 


Trp 


Lys 


Ser 


Ser 


Ala 


Ala 


Asn 






165 










170 










175 




Tyr 


Trp 


Lys 


Glu 


Gin 


He 


He 


Glu 


Ala 


Lys 


Gly 


Pro 


Asp 


Val 


Cys 


Thr 




180 










185 










190 






Pro 


Thr 


Tyr 
195 


Cys 


Asn 


Pro 


Asn 


Ala 
200 


Pro 


Tyr 


Ser 


Thr 


Asn 
205 


Thr 


Ser 


Thr 
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Val Ala Phe Gin Val Trp Leu Asia Phe Gly Val Arg Ala Asn lie Tyr 

210 215 220 

Lys His Asn Gly Val Glu Phe Gly Val Arg Val Pro Leu Leu lie Asn 
225 230 235 240 

5 Lys Phe Leu Ser Ala Gly Pro Asn Ala Thr Asn Leu Tyr Tyr His Leu 

245 250 255 

Lys Arg Asp Tyr Ser Leu Tyr Leu Gly Tyr Asn Tyr Thr Phe 
260 265 270 

10 (2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 38 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



20 



(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
25 (A) NAME/KEY: misc_feature 

(B) LOCATION 1...438 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:145: 

30 Met Ala Tyr Lys Pro Asn Lys Lys Lys Leu Lys Glu Leu Arg Glu Gin 
15 10 15 

Pro Asn Leu Phe Ser lie Leu Asp Lys Gly Asp Val Ala Thr Asn Asn 

20 25 30 

Pro Val Glu Glu Ser Asp Lys Ala Asn Lys lie Gin Glu Pro Leu Pro 
35 35 40 45 

Tyr Val Val Lys Thr Gin lie Asn Lys Ala Ser Met lie Ser Arg Asp 

50 55 60 

Pro lie Glu Trp Ala Lys Tyr Leu Ser Phe Glu Lys Arg Val Tyr Lys 
65 70 75 80 

40 Asp Asn Ser Lys Glu Asp Val Asn Phe Phe Ala Asn Gly Glu lie Lys 

85 90 95 

Glu Ser Ser Arg Val Tyr Glu Ala Asn Lys Glu Gly Phe Glu Arg Arg 

100 105 110 

lie Thr Lys Arg Tyr Asp Leu lie Asp Arg Asn lie Asp Arg Asn Arg 
45 115 120 125 

Glu Phe Phe lie Lys Glu lie Glu lie Leu Thr His Thr Asn Ser Leu 

130 135 140 

Lys Glu Leu Lys Glu Gin Gly Leu Glu lie Gin Leu Thr His His Asn 
145 150 155 160 

50 Glu Thr His Lys Lys Ala Leu Glu Asn Gly Asn Glu lie Val Lys Glu 

165 170 175 

Tyr Asp His Leu Lys Asp lie Tyr Gin Glu Val Glu Arg Thr Lys Asp 

180 185 190 

Gly Gly Leu Val Arg Glu lie lie Pro Ser lie Ser Ser Ala Glu Tyr 
55 195 200 205 
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10 



15 



20 



25 



30 



40 



45 



Phe 


Lys 
210 


Leu 


Tyr 


Asn 


Lys 


Leu 
215 


Pro 


Phe 


Glu 


Ser 


He 
22 0 


Asn 


Asn 


L3J.U 


Asn 


Thr 


Lys 


Leu 


Asn 


Thr 


Asn 


Asp 


Asn 


Glu 


Glu 


Val 


Lys 


Lys 


Leu 


Glu 


Phe 


225 








230 










235 












Glu 


Leu 


Ala 


Lys 


Glu 


Val 


His 


He 


Leu 


He 


Leu 


Glu 


Gin 


Gin 


Leu 


Leu 






245 










250 










0 c c 
25b 




Ser 


Ala 


Thr 


Asn 


Tyr 


Tyr 


Ser 


Trp 


He 


Asp 


Lys 


Asp 


Asp 


Asn 


Ala 


Asn 






260 










265 










270 






Phe 


Ala 


Trp 
275 


Lys 


Met 


His 


Arg 


Leu 
280 


He 


Asn 


Glu 


Asn 


Lys 
2 8 5 


Leu 


Lys 


Glu 


Asn 


His 


Leu 


Ser 


Ala 


Asn 


Asn 


Ala 


Asn 


Lys 


He 


Lys 


Gin 


Phe 


Pne 


Pne 


290 










295 










300 










Asn 


Asn 


Gly 


Ser 


He 


Leu 


Gly 


Trp 


Thr 


Lys 


Glu 


Glu 


Gin 


Ser 


Ala 


He 


305 








310 










315 










-y ^ r\ 


Gin 


Glu 


Asn 


Arg 


Asp 


Tyr 


Ser 


Leu 


Arg 


Ser 


Ala 


Leu 


Leu 


Ser 


Leu 


Glu 






325 










330 










^ "a c 
335 




Glu 


He 


Ala 


Gin 


Ala 


Lys 


He 


Glu 


Leu 


Gin 


Lys 


Tyr 


Tyr 


Glu 


Ser 


Val 






340 










345 










350 




V3-L U 


Tyr 


Val 


Asn 


Gly 


Asp 


Gly 


Asn 


Lys 


Arg 


Glu 


He 


Lys 


Pro 


Pne 


Lys 




355 










360 










365 








He 


Leu 


Arg 


Asp 


Thr 


Asn 


Asn 


Phe 


Glu 


Lys 


Ala 


Tyr 


Lys 


Glu 


Arg 


Tyr 




370 








375 










380 








Gly 


Asp 


Lys 


Leu 


Val 


Ser 


Leu 


Ser 


Ala 


Ala 


He 


He 


Gin 


Ala 


Lys 


Glu 


385 








390 










395 








He 


400 


Gly 


Asn 


Glu 


Arg 


Pro 


Asn 


Ser 


Ser 


Ala 


Asn 


Asn 


Asn 


Asn 


Pro 


Lys 








405 










410 










415 




Asn 


Thr 


He 


Glu 


Thr 


Asn 


Thr 


Ser 


Asn 


Asn 


He 


He 


Gin 


Asn 


Asn 


Asp 






420 










425 










430 






Asn 


He 


He 
435 


He 


Gin 


He 























(2) INFORMATION FOR SEQ ID NO: 146: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...215 

50 (3ci) SEQUENCE DESCRIPTION: SEQ ID NO:146: 

Met Gin Ala Leu Lys Ser Leu Leu Glu Val He Thr Lys Leu Gin Asn 
1 5 10 15 

Leu Gly Gly Tyr Leu Met His He Ala He Phe He He Phe He Trp 

55 20 25 30 
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Leu 


Lys 




1 

V cLX 


Pro 


Tyr 


Glu 




Glu Gly 


X X c 


Ala 


Pro 








35 










40 










*± 3 










Pne 


V3L± 


A J. a 


Asn. 


Ser 


Pro 


XT lit; 
55 




Ser 


Phe 






Lys 




u J. U 


Lys 


c 

J 


Pro 
65 


AX a 


Tyr 


Lys 




rljL S 
/ u 


Lys 




Ser 


Glu 


Ser 

/ 3 




Ser 


Mot* 


m n 


fi n 
0 \j 




GlU 


Met 


Gin 


Asp 


Asn. 
85 


Pro 


Lys 




Val 
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25 

(2) INFORMATION FOR SEQ ID NO: 147: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1., . ,20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 7: 



TATACCATGG TGGGCGCTAA 



(2) INFORMATION FOR SEQ ID NO: 148: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

( ii ) MOLECITLE TYPE : DNA (genomic) 

( iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOXmCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc__f eature 

(B) LOCATION 1...23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:148: 
ATGAATTCGA GTAAGGATTT TTG 
(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9; 
TTAACCATGG TGAAAAGCGA TA 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 



23 



22 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

5 (vi) ORIGINAL SOtlRCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
10 (B) LOCATION l..,23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TAGAATTCGC ATAACGATCA ATC 

-( 2r -INFORMATION~FOR- SEQ ID~NOTr5T^ 



15 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



25 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
35 (B) LOCATION 1...22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

ATATCCATGG TGAGTTTGAT GA 

(2) INFORMATION FOR SEQ ID NO: 152: 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
45 (B) TYPE: nucljeic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



50 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

55 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: inisc_f eature 

(B) LOCATION 1. . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l52 
ATGAATTCAA TTTTTTATTT TGCCA 
(2) INFORMATION FOR SEQ ID NO:153: 



(i) SEQUENCE CHARACTERISTICS: 
(A) liENGTH: 21 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 

( i i i ) H YPOTHET I C AL : NO 

(iv) ANTI- SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mi sc__f eature 
30 (B) LOCATION 1...21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 

AATTCCATGG TGGGGGCTAT G 

(2) INFORMATION FOR SEQ ID NO:154: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/KEY : misc_f eature 
55 (B) LOCATION 1 ... 23 



25 



21 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:154; 
ATGAATTCTC GATAGCCAAA ATC 

5 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : c ir cul ar 



15 



(ii) MOLECULE TYPE: DNA (genomic) 

(-i-i-i-)- HY-POTKETIGAL : NO 

(iv) ANTI- SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
25 (B) LOCATION 1.,.25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

AATTCCATGG TGCATAACTT CCATT 

(2) INFORMATION FOR SEQ ID NO: 156: 



30 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



40 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
50 (B) LOCATION 1...25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

AAGAATTCTC TAGCATCCAA ATGGA 

55 
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(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
20 (B) LOCATION 1. . .24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157 

ATTTCCATGG TCATGTCTCA TATT 

(2) INFORMATION FOR SEQ ID NO: 158: 



(i) SEQUENCE CJIARACTERISTICS : 
(A) LENGTH: 23 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 7 base pairs 
55 (B) TYPE: nucleic acid 



24 



40 (vi) ORIGINAL SOURCE; 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
45 (B) LOCATION 1, . .23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l58: 

ATGAATTCCA TCTTTTATTC CAC 

(2) INFORMATION FOR SEQ ID NO: 159: 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOIiECUIiE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 
15 (B) LOCATION 1. . .27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:159: 

AACCATGGTG ATTTTAAGCA TTGAAAG 

(2) INFORMATION FOR SEQ ID NO: 160: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 



30 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
40 (B) LOCATION 1...28 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 0: 

AAGAATTCCA CTCAAAATTT TTTAACAG 

(2) INFORMATION FOR SEQ ID NO: 161: 



45 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAIj: NO 

(iv) ANTI- SENSE : NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATORE: 

(A) NAME/KEY: misc_f eature 
10 (B) LOCATION 1...25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GATCATCCAT ATGTTATCTT CTAAT 25 

(2) INFORMATION FOR SEQ ID NO: 162: 



15 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



25 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
35 (B) LOCATION 1.,.23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: 

TGAATTCAAC CATTTTAACC CTG 23 

(2) INFORMATION FOR SEQ ID NO: 16 3: 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 7 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



50 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

55 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATXmE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163; 
TATACCATGG TGAAATTTTT TCTTTTA 
(2) INFORMATION FOR SEQ ID NO: 164: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
15 (B) TYPE: nucleic acid 

" ' ' Kcr STRANDEDNESS : double" 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
30 (B) LOCATION 1...2S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: 

AGAATTCAAT TGCGTCTTGT AAAAG 

(2) INFORMATION FOR SEQ ID NO: 165: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
55 (B) LOCATION 1...24 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: 
TATACCATGG TGATGGACAA ACTC 24 

5 

(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



15 



30 



40 



(ii) MOLECULE TYPE: DNA (genomic) 
( ii i ) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 



20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
25 (B) LOCATION 1...23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

ATGAATTCCC ACTTGGGGCG ATA 23 

(2) INFORMATION FOR SEQ ID NO: 167: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

. (ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
50 (B) LOCATION 1...25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: 

TTATGGATCC AAACCAATTA AAACT 25 

55 
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(2) INFORMATION FOR SEQ ID NO:168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

( A)"ORGANrSMT"HelxcoSacter pyloFx 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
20 (B) LOCATION 1...23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 8: 

TATCTCGAGT TATAGAGAAG GGC 23 

(2) INFORMATION FOR SEQ ID NO: 169: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



35 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
45 (B) LOCATION 1... .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

TTAACCATGG TGAAAAGCGA TA 22 

(2) INFORMATION FOR SEQ ID NO: 17 0: 



50 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
55 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
15 (B) LOCATION 1...24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 

TAGAATTCGC CTCTAAAACT TTAG 

(2) INFORMATION FOR SEQ ID NO: 171: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



30 



45 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



24 



35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
40 (B) LOCATION 1...22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

TTAACCATGG TGAAAAGCGA TA 

(2) INFORMATION FOR SEQ ID NO: 172: 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: inisc_f eature 
10 (B) LOCATION 1-..23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:172; 

TAGAATTCGC ATAACGATCA ATC 

15 



25 



40 



50 



(2T INFORMATION FOR SEQ ID NO: 173: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 22 base pairs 
20 (B) TYPE: nucleic acid 

( C ) S TRANDEDNES S : doubl e 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL; NO 

(iv) ANTI -SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc^feature 
35 (B) LOCATION 1...22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

ATATCCATGG TGAGTTTGAT GA 

(2) INFORMATION FOR SEQ ID NO: 174: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

55 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:174 
ATGAATTCAA TTTTTTATTT TGCCA 
(2) INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 
( iii ) HYPOTHETICAL : NO 
(iv) ANTI- SENSE: NO 



25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
30 (B) LOCATION 1...23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l75: 

AATTCCATGG CTATCCAAAT CCG 

(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
55 (B) LOCATION 1...25 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:176: 
ATGAATTCGC CAAAATCGTA GTATT 

5 

(2) INFORIVIATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



15 



30 



40 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
25 (B) LOCATION 1...24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

GATACCATGG AATTTATGAA AAAG 

(2) INFORMATION FOR SEQ ID NO: 178: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE:. 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
50 (B) LOCATION 1...2S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:178: 

TGAATTCGAA AAAGTGTAGT TATAC 

55 
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(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
20 (B) LOCATION 1. . .19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:179: 
CCCTTCATTT TAGAAATCG 

(2) INFORMATION FOR SEQ ID NO:1BO: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 



35 



50 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
55 (B) TYPE: nucleic acid 



19 



(ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
45 (B) LOCATION 1.,.20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:180: 

20 

ATTTCAACCA ATTCAATGCG 
(2) INFORMATION FOR SEQ ID NO: 181: 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

(ii) MOLECUXiE TYPE: DNA (genomic) 

5 

( i i i ) HYPOTHET I CAL : NO 

(iv) ANTI- SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
15^ _ (B) LOCATION 1 . . .20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

GCCCCTTTTG ATTTGAAGCT 20 

(2) INFORMATION FOR SEQ ID NO: 182: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



30 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
40 (B) LOCATION 1...22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

TCGCTCCAAG ATACCAAGAA GT 22 

(2) INFORMATION FOR SEQ ID NO: 183: 



45 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



55 
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(iii) HYPOTHETICAL: NO 

(iv) AMTI- SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 
10 (B) LOCATION 1...22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 

CTTGAATTAG GGGCAAAGAT CG 22 

(2) INFORMATION FOR SEQ ID NO:184: 



15 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: circular 



25 



40 



50 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
35 (B) LOCATION 1. . ,22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

ATGCGTTTTT ACCCAAAGAA GT 22 

(2) INFORMATION FOR SEQ ID NO:185: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

55 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1 . . .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 5: 
ATAACGCCAC TTCCTTATTG GT 22 
(2) INFORMATION FOR SEQ ID NO: 186: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
1.5 (B) TYPE: nucleic a cid , 

(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : ci r cular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
30 (B) LOCATION l..,19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 6: 

CTTTGGGTAA AAACGCATC 

(2) INFORMATION FOR SEQ ID NO: 187: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
55 (B) LOCATION 1...20 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
CGATCTTTGA TCCTAATTCA 20 

5 

(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LiENGTH: 19 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: circular 



15 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
25 (B) LOCATION 1...19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:188; 

ATCAAGTTGC CTATGCTGA 

(2) INFORMATION FOR SEQ ID NO: 189: 



30 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



40 



19 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
50 (B) LOCATION 1.,.22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

TTGAACACTT TTGATTATGC GG 22 

55 
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(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 3 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: circular 



10 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORGT^ISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
20 (B) LOCATION 1...23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190; 

GGATTATGCG ATTGTTTTAC AAG 

(2) INFORMATION FOR SEQ ID NO: 191: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



35 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix ) FEATURE : 

(A) NAME/KEY: misc_feature 
45 (B) LOCATION 1...21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

GTCTTTAGCA AAAATGGCGT C 

50 

(2) INFORMATION FOR SEQ ID NO: 192: 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

( iii ) HYPOTHETICAL : NO 

( iv) ANTI - SENSE : NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
15 (B) LOCATION 1...21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:192: 

AATGAGCGTA AGAGAGCCTT C 

(2) INFORMATION FOR SEQ ID NO:193: 



20 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 18 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



30 



45 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
40 (B) LOCATION 1 ... 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 

CTTATGGGGG TATTGTCA 

(2) INFORMATION FOR SEQ ID NO:194: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



18 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE; 

(A) NAME/KEY : misc_f eature 
10 (B) LOCATION 1. . .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:194: 

AGCATGTGGG TATCCAGC 

15 



(2) INFORMATION FOR SEQ ID NO:195: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



25 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
35 (B) LOCATION 1...19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:195: 

AGGTTGTTGC CTAAAGACT 

(2) INFORMATION FOR SEQ ID NO: 196: 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



50 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
55 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:196 
CTGCCTCCAC CTTTGATC 

(2) INFORMATION FOR SEQ ID NO: 197: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : douible 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
30 (B) LOCATION 1...19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:197 
ACCAATATCA ATTGGCACT 

(2)' INFORMATION FOR SEQ ID NO:198: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
40 (B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 
55 (B) LOCATION 1...18 



18 



19 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:198: 
ACTTGGAAAA GCTCTGCA 

5 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
10 (B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: circular 



15 



(ii) MOLECULE TYPE: DNA (genomic) 

(-i-ii ) HYPOTHETICAL-: ^NO 

(iv) ANTI- SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix> FEATURE: 

(A) NAME/KEY: misc_feature 
25 (B) LOCATION 1.,.19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

CTTGCTTGTC ATATCTAGC 

(2) INFORMATION FOR SEQ ID NO: 200: 



30 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 



40 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
50 (B) LOCATION 1...18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

GTTGAAGTGT TGGTGCTA 

55 



wo 98/18323 



PCT/US97/19575 



10 



252 



(2) INFORMATION FOR SEQ ID NO: 2 01 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 
20 (B) LOCATION 1...22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01: 
CAAGCAAGTG GTTTGGTTTT AG 

25 

(2) INFORMATION FOR SEQ ID N0:202: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

35 

( iii ) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
45 (B) LOCATION 1,..22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 
TGGAAAGAGC AAATCATTGA AG 

50 

(2) INFORMATION FOR SEQ ID NO:203: 



(i) 



30 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDBDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

10 (vi) ORIGINJU. SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
15 (B) LOCATION 1. ..21 _ , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203; 

GCCCATAATC AAAAAGCCCA T 

(2) INFORMATION FOR SEQ ID NO:204: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
25 (B) TYPE: nucleic acid 

( C ) S TRANDEDNES S : doubl e 

(D) TOPOLOGY: circular 



30 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE; 

( A ) NAME / KEY : mis c_f e a tur e 
40 (B) LOCATION 1...24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 

CTAAAACCAA ACCACTTGCT TGTC 

(2) INFORMATION FOR SEQ ID NO: 2 05: 



45 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
50 (B) TYPE: nucleic acid 

(C) S TRANDEDNES S : double 

(D) TOPOLOGY: circular 



55 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
10 (B) LOCATION 1...16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 
GTAAAACGAC GGCCAG 

(2) INFORMATION FOR SEQ ID NO:206: 



15 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



25 



40 



50 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
( iv) ANTI - SENSE : NO 



30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 
35 (B) LOCATION 1...17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 
CAGGAAACAG CTATGAC 

(2) INFORMATION FOR SEQ ID NO: 207: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



55 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1..-21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 
ATCTTACCTA TCACCTCTUUl T 
(2) INFORMATION FOR SEQ ID NO:208: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 
30 (B) LOCATION 1...21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 

AGACAGCAAC ATCTTTGTGA A 
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1 . An isolated nucleic acid comprising a nucleotide sequence encoding an 
5 H, pylori polypeptide at least about 60% homologous to an amino acid sequence 

selected from the group consisting of SEQ ID NO: 74-SEQ ID NO: 146. 

2. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H, pylori polypeptide selected from the group consisting of SEQ ID NO: 74-SEQ ID 

10 NO: 146. 

3 . An isolated nucleic acid which encodes an K pylori polypeptide, 
comprising a nucleotide sequence at least about 60% homologous to a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 73, or a 

15 complement thereof. 

4. The isolated nucleic acid of claim 1 , comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 73, or a complement 
thereof. 



20 



25 



30 



5. An isolated nucleic acid molecule encoding an K pylori polypeptide, 
comprising a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule comprising the nucleotide sequence selected from 
the group consisting of SEQ ID NO: 1-SEQ ID NO: 73, or a complement thereof. 

6. An isolated nucleic acid comprising a nucleotide sequence of at least 8 
nucleotides in length, wherein the sequence hybridizes under stringent hybridization 
conditions to a nucleic acid having a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-SEQ ID NO: 73, or a complement thereof. 



7. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H, pylori cell envelope polypeptide or a fragment thereof, said nucleic acid selected 
from the group consisting of SEQ ID NO: 3, SEQ ID NO: 25, SEQ ID NO: 48, SEQ ID 
NO: 16, SEQ ID NO: 10, SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID 
35 NO: 7, SEQ ID NO: 39, SEQ ID NO: 55, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID 
NO: 28, SEQ ID NO: 30, SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, SEQ ID 
NO: 58, SEQ ID NO: 1, SEQ ID NO: 42, SEQ ID NO: 14, SEQ ID NO: 43, SEQ ID 



wo 98/18323 



PCT/US97/19575 



- 257 - 

NO: 1 1, SEQ ID NO: 71, SEQ ID NO: 17, SEQ ID NO: 57, SEQ ID NO: 5, SEQ ID 
NO: 6, SEQ ID NO: 8, and SEQ ID NO: 21, or a complement thereof. 

8. The isolated nucleic acid of claim 7, wherein said //. pylori cell envelope 
5 polypeptide or a fragment thereof is an K pylori inner membrane polypeptide or a 

fragment thereof encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 3, SEQ ID NO: 25, and SEQ ID NO: 48, or a complement thereof. 

9. The isolated nucleic acid of claim 7, wherein said K pylori cell envelope 
1 0 polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide or a 

fragment thereof encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 16, SEQ ID NO: 10, SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID NO: 37, SEQ 
ID NO: 7, SEQ ID NO: 39, SEQ ID NO: 55, SEQ ID NO: 18, SEQ ID NO: 19, SEQ IE) 
NO: 28, SEQ ID NO: 30, SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, SEQ ID 
15 NO: 58, SEQ ID NO: 1, SEQ ID NO: 42, SEQ ID NO: 14, SEQ ID NO: 43, SEQ ID 
NO: 1 1 , and SEQ ID NO: 71 , or a complement thereof. 



1 0. The isolated nucleic acid of claim 9, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H, pylori polypeptide having a 
20 terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
encoded by a nucleic acid selected from the group consisting of SEQ ID NO: 1, SEQ ID 
NO: 42, SEQ ID NO: 14, SEQ ID NO: 43, SEQ ID NO: 1 1 and SEQ ID NO:71, or a 
complement thereof. 



25 11. The isolated nucleic acid of claim 9, wherein said K pylori outer 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue or a fragment thereof encoded by a nucleic acid selected 
from the group consisting of SEQ ID NO: 16, SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID 
NO: 37, SEQ ID NO: 7, SEQ ID NO: 39, SEQ ID NO: 55, SEQ ID NO: 18, SEQ ID 

30 NO: 19, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID 
NO: 56, SEQ ID NO: 58, or a complement thereof. 



12. An isolated nucleic acid comprising a nucleotide sequence encoding an 
K pylori cell envelope polypeptide or a fragment thereof selected from the group 
35 consisting of SEQ ID NO: 76, SEQ ID NO: 98, SEQ ID NO: 121, SEQ ID NO: 89, SEQ 
ID NO: 83, SEQ ID NO: 118, SEQ ID NO: 108, SEQ ID NO: 1 10, SEQ ID NO: 80, 
SEQ ID NO: 1 12, SEQ ID NO: 128, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 
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101, SEQ ID NO: 103, SEQ ID NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID 
NO: 131, SEQ ID NO: 74, SEQ ID NO: 115, SEQ ID NO: 87. SEQ ID NO: 116, SEQ 
ID NO: 84, SEQ ID NO: 144, SEQ ID NO: 90, SEQ ID NO: 130, SEQ ID NO: 78, SEQ 
ID NO: 79, SEQ ID NO: 81, and SEQ ID NO: 94. 

5 

13. The isolated nucleic acid of claim 12, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 76, SEQ ID 
NO: 98, and SEQ ID NO: 121. 

10 

14. The isolated nucleic acid of claim 12, wherein said H. pylori cell 
envelope polypeptide or, a fragment thereof is an H. pylori outer membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 89, SEQ ID 
NO: 83, SEQ ID NO: 118, SEQ ID NO: 108, SEQ ID NO: 1 10, SEQ ID NO: 80, SEQ 

15 ID NO: 112, SEQ ID NO: 128, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 101, 
SEQ ID NO: 103, SEQ ID NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 
131, SEQ ID NO: 74, SEQ ID NO: 115, SEQ ID NO: 87, SEQ ID NO: 1 16, SEQ ID 
NO: 84, SEQ ID NO: 144, SEQ ID NO: 90, and SEQ ID NO: 130. 

20 15. The isolated nucleic acid of claim 1 4, wherein said H. pylori outer 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
selected from the group consisting of SEQ ID NO: 74, SEQ ID NO: 1 15, SEQ ID NO: 
87, SEQ ID NO: 1 16, and SEQ ID NO: 84 and SEQ ID NO: 144. 

25 

16. The isolated nucleic acid of claim 14, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue or a fragment thereof selected from the group consisting 
of SEQ ID NO: 89, SEQ ID NO: 118, SEQ ID NO: 108, SEQ ID NO: 1 10, SEQ ID 

30 NO: 80, SEQ ID NO: 1 12, SEQ ID NO: 128, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID 
NO: 101, SEQ ID NO: 103, SEQ ID NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, and 
SEQ ID NO: 131. 

17. An isolated nucleic acid comprising a nucleotide sequence encoding an 
35 H. pylori secreted polypeptide or a fragment thereof, said nucleic acid selected from the 

group consisting of SEQ ID NO: 72, SEQ ID NO: 32, SEQ ID NO: 51, SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 22, SEQ ID NO: 29, SEQ 
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ID NO: 31, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, SEQ 
ID NO: 40, SEQ ID NO: 41, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ 
ID NO: 53, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ 
ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, and SEQ ID NO: 68, or a complement 
5 thereof. 



1 8. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori secreted polypeptide or a fragment thereof selected from the group consisting 
of SEQ ID NO: 145, SEQ ID NO: 105, SEQ ID NO: 124, SEQ ID NO: 75, SEQ ID NO: 

JO 77. SEO ID NO: 82 . SEO ID NO: 8.6._SEO_ID^NO:_95,.SEO_ID_NQ:J.Q2„SEQ-ID-NO: . . 
104, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 109, SEQ ID NO: 1 1 1, SEQ ID 
NO: 1 13, SEQ ID NO: 1 14, SEQ ID NO: 117, SEQ ID NO: 1 19, SEQ ID NO: 122, SEQ 
ID NO: 126, SEQ ID NO: 132, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136,' 
SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, and SEQ ID NO: 141. 

15 

19. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori cellular polypeptide or a fragment thereof, said nucleic acid selected from the 
group consisting of SEQ ID NO: 12, SEQ ID NO: 15, SEQ ID NO: 20, SEQ ID NO: 23, 
SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 47, SEQ ID NO: 50, 

20 SEQ ID NO: 60, SEQ ID NO: 64, SEQ ID NO: 69, SEQ ID NO: 70, and SEQ ID NO: 
73, or a complement thereof. 



20. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori cellular polypeptide or a fragment thereof selected from the group consisting 

25 of SEQ ID NO: 85, SEQ ID NO: 88, SEQ ID NO: 93, SEQ ID NO: 96, SEQ ID NO: 97, 
SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 120, SEQ ID NO: 123, SEQ ID NO: 
133, SEQ ID NO: 137, SEQ ID NO: 142, SEQ ID NO: 143, and SEQ ID NO: 146. 

21. A probe comprising a nucleotide sequence consisting of at least 8 

30 nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID NO: 
1-SEQ ID NO: 73, or a complement thereof 



35 



22. A recombinant expression vector comprising the nucleic acid of any of 
claims 1, 2, 3, 4, 5, 6, 7, 12, 17, 18, 19 or 20 operably linked to a transcription regulatory 
element. 
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23. A cell comprising a recombinant expression vector of claim 22, 

24. A method for producing an H. pylori polypeptide comprising culturing a 
cell of claim 23 under conditions that permit expression of the polypeptide. 

5 

25. The method of claim 24, further comprising purifying the polypeptide 
from the cell. 

26. A method for detecting the presence of a Helicobacter nucleic acid in a 

1 0 sample comprising: 

(a) contacting a sample with a nucleic acid of any of claims 6 or 21 
so that a hybrid can form between the probe and a Helicobacter nucleic acid in the 
sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
1 5 hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 

27. An isolated H. pylori polypeptide comprising an amino acid sequence at 
least about 60% homologous to an H pylori polypeptide selected from the group 
consisting of SEQ ID NO: 74-SEQ ID NO: 146. 

20 

28. An isolated K pylori polypeptide which is encoded by a nucleic acid 
comprising a nucleotide sequence at least about 60% homologous to a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 73. 

25 29. The isolated H pylori polypeptide of claim 28, wherein said polypeptide 

is encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 
1-SEQ ID NO: 73. 

30. An isolated H, pylori polypeptide which is encoded by a nucleic acid 

30 which hybridizes under stringent hybridization conditions to a nucleic acid selected from 
the group consisting of SEQ ID NO: 1-SEQ ID NO: 73, or a complement thereof. 

31. An isolated H. pylori polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 74-SEQ ID NO: 146. 



35 



32. An isolated H pylori cell envelope polypeptide or a fragment thereof, 
wherein said polypeptide is selected from the group consisting of SEQ ID NO: 76, SEQ 
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ID NO: 98, SEQ ID NO: 121, SEQ ID NO: 89, SEQ ID NO: 83, SEQIDNO: 118, SEQ 
ID NO: 108, SEQ ID NO: 1 10, SEQ ID NO: 80, SEQ ID NO: 112, SEQ ID NO: 128, 
SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 
125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 131, SEQ ID NO: 74, SEQ ID 
5 NO: 1 15, SEQ ID NO: 87, SEQ ID NO: 116, SEQ ID NO: 84, SEQ ID NO: 144, SEQ 
ID NO: 90, SEQ ID NO: 130, SEQ ID NO: 78, SEQ ID NO: 79, SEQ ID NO: 81, and 
SEQ ID NO: 94. 



33. The isolated polypeptide of claim 32, wherein said H. pylori cell 
10 envelgEe_i)giyge|jtide_Qr_a.fragment thereof-^ 

or a fragment thereof selected from the group consisting of SEQ ID NO: 76, SEQ ID 
NO: 98, and SEQ ID NO: 121. 



34. The isolated polypeptide of claim 32, wherein said H. pylori cell 
15 envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 89, SEQ ID 
NO: 83, SEQ ID NO: 118, SEQ ID NO: 108, SEQ ID NO: 1 10, SEQ ID NO: 80, SEQ 
ID NO: 1 1 2, SEQ ID NO: 1 28, SEQ ID NO: 9 1 , SEQ ID NO: 92, SEQ ID NO: 101, 
SEQ ID NO: 103, SEQ ID NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, SEQ ID NO: 
20 131, SEQ ID NO: 74, SEQ ID NO: 115, SEQ ID NO: 87, SEQ ID NO: 1 1 6, SEQ ID 
NO: 84, SEQ ID NO: 144, SEQ ID NO: 90, and SEQ ID NO: 130. 



35. The isolated polypeptide of claim 34, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

25 terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
selected from the group consisting of SEQ ID NO: 74, SEQ ID NO: 1 15, SEQ ID NO: 
87, SEQ ID NO: 1 16, and SEQ ID NO: 84 and SEQ ID NO:144. 

36. The isolated polypeptide of claim 34, wherein said H. pylori outer 
30 membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

terminal phenylalanine residue or a fragment thereof selected from the group consisting 
of SEQ ID NO: 89, SEQ ID NO: 1 18, SEQ ID NO: 108, SEQ ID NO: 1 10, SEQ ID NO: 
80, SEQ ID NO: 1 12, SEQ ID NO: 128, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 
101, SEQ ID NO: 103, SEQ ID NO: 125, SEQ ID NO: 127, SEQ ID NO: 129, and SEQ 
35 ID NO: 131. 
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10 



15 



37. An isolated H. pylori cell envelope polypeptide or a fragment thereof, 
wherein said polypeptide is encoded by a nucleic acid selected from the group consisting 
of SEQ ID NO: 3, SEQ ID NO: 25, SEQ ID NO: 48, SEQ ID NO: 16, SEQ ID NO: 10, 
SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 7, SEQ ID NO: 39, 
SEQ ID NO: 55, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, SEQ ID NO: 58, SEQ ID NO: 1, 
SEQ ID NO: 42, SEQ ID NO: 14, SEQ ID NO: 43, SEQ ID NO: 11, SEQ ID NO: 71, 
SEQ ID NO: 17, SEQ ID NO: 57, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, and 
SEQ ID NO: 21. 

38. The isolated polypeptide of claim 37, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide 
or a fragment thereof encoded by a nucleic acid selected from the group consisting of 
SEQ ID NO: 3, SEQ ID NO: 25, and SEQ ID NO: 48. 



39. The isolated polypeptide of claim 37, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide 
or a fragment thereof encoded by a nucleic acid selected from the group consisting of 
SEQ ID NO: 16, SEQ ID NO: 10, SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID NO: 37, 
20 SEQ ID NO: 7, SEQ ID NO: 39, SEQ ID NO: 55, SEQ ID NO: 1 8, SEQ ID NO: 19, 
SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, 
SEQ ID NO: 58, SEQ ID NO: 1, SEQ ID NO: 42, SEQ ID NO: 14, SEQ ID NO: 43, 
SEQ ID NO: 1 1, and SEQ ID NO: 71 . 

25 40. The isolated polypeptide of claim 39, wherein said H. pylori outer 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
encoded by a nucleic acid selected from the group consisting of SEQ ID NO: 1, SEQ ID 
NO: 42, SEQ ID NO: 14, SEQ ID NO: 43; SEQ ID NO: 1 1 and SEQ ID NO:71. 

30 

4 1 . The isolated polypeptide of claim 3 9, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue or a fragment thereof encoded by a nucleic acid selected 
from the group consisting of SEQ ID NO: 16, SEQ ID NO: 45, SEQ ID NO: 35, SEQ ID 
35 NO: 37, SEQ ID NO: 7, SEQ ID NO: 39, SEQ ID NO: 55, SEQ ID NO: 1 8, SEQ ID 
NO: 19! SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID 
NO: 56, SEQ ID NO: 58. 
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42. An isolated H. pylori cellular polypeptide or a fragment thereof, wherein 
said polypeptide is selected from the group consisting of SEQ ID NO: 85, SEQ ID NO: 
88, SEQ ID NO: 93, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO: 

5 100, SEQ ID NO: 120, SEQ ID NO: 123, SEQ ID NO: 133, SEQ ID NO: 137, SEQ ID 
NO: 142, SEQ ID NO: 143, and SEQ ID NO: 146. 

43. An isolated H. pylori cellular polypeptide or a fragment thereof, wherein 
said polypeptide is encoded by a nucleic acid selected from the group consisting of SEQ 

1 0 ID NO: 12, SEQ ID NO: 15, SEQ ID NO: 20, SEQ ID NO: 23 . SEQ ID NO: 24 , SEQ . 
ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 47, SEQ ID NO: 50, SEQ ID NO: 60, SEQ 
ID NO: 64, SEQ ID NO: 69, SEQ ID NO: 70, and SEQ ID NO: 73. 



44. An isolated H. pylori secreted polypeptide or a fragment thereof, wherein 
1 5 said polypeptide is selected from the group consisting of SEQ ID NO: 145, SEQ ID NO: 
105, SEQ ID NO: 124, SEQ ID NO: 75, SEQ ID NO: 77, SEQ ID NO: 82, SEQ ID NO: 
86, SEQ ID NO: 95, SEQ ID NO: 102, SEQ ID NO: 104, SEQ ID NO: 106, SEQ ID 
NO: 107, SEQ ID NO: 109, SEQ ID NO: 111, SEQ ID NO: 1 13, SEQ ID NO: 1 14, SEQ 
ID NO: 1 17, SEQ ID NO: 1 19, SEQ ID NO: 122, SEQ ID NO: 126, SEQ ID NO: 132, 
20 SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 138, SEQ IDNO: 
139, SEQ IDNO: 140, and SEQ ID NO: 141. 



45. An isolated H. pylori secreted polypeptide or a fragment thereof, wherein 
said polypeptide is encoded by a nucleic acid selected from the group consisting of SEQ 

25 ID NO: 72, SEQ ID NO: 32, SEQ ID NO: 51, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 9. SEQ ID NO: 13, SEQ ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID 
NO: 33, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 40, SEQ ID 
NO: 41, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 53, SEQ ID 
NO: 59, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 65, SEQ ID 

30 NO: 66, SEQ ID NO: 67, and SEQ ID NO: 68. 



46. A fusion protein comprising an H. pylori polypeptide which comprises an 
amino acid sequence selected from the group consisting of SEQ ID NO: 74-SEQ ID NO: 
146 operatively linked to a non-H. pylori polypeptide. 



35 
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47. A vaccine formulation for prophylaxis or treatment of an H. pylori 
infection comprising an effective amount of at least one isolated nucleic acid of any of 
claims 1, 2, 3, 4, 5, 6, 7, 12, 17, 18, 19, or 20. 

5 48. A vaccine formulation for prophylaxis or treatment of an H. pylori 

infection comprising an effective amount of at least one H. pylori polypeptide or a 
fragmentthereof of any of claims 26, 27, 28, 29, 30, 31, 32, 37, 42, 43, 44 or 45. 

49. A vaccine formulation of claim 47, further comprising a pharmaceutically 
10 acceptable carrier. 

50. A vaccine formulation of claim 48, further comprising a pharmaceutically 
acceptable carrier. 

51. A vaccine formulation of claim 49, wherein the pharmaceutically 
acceptable carrier comprises an adjuvant. 

52. A vaccine formulation of claim 50, wherein the pharmaceutically 
acceptable carrier comprises an adjuvant. 

53 . A vaccine formulation of claim 49, wherein the pharmaceutically 
acceptable carrier comprises a delivery system. 

54. A vaccine formulation of claim 50, wherein the pharmaceutically 
acceptable carrier comprises a delivery system. 

55 . A vaccine formulation of claim 5 3 , wherein the delivery system 
comprises a live vector. 

A vaccine formulation of claim 54, wherein the delivery system 
live vector. 

A vaccine formulation of claim 55, wherein the live vector is a bacteria o 
A vaccine formulation of claim 56, wherein the live vector is a bacteria o 

a virus. 



15 



20 



25 



30 56. 
comprises a 

57. 

a virus. 

35 

58. 
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59. A vaccine formulation of claim 53, wherein the pharmaceutically 
acceptable carrier further comprises an adjuvant. 

5 60. A vaccine formulation of claim 54, wherein the pharmaceutically 

acceptable carrier further comprises an adjuvant. 

61. A method of treating or reducing a risk of K pylori infection in a subject 
comprising administering to a subject a vaccine formulation of claim 47, such that 

10 treatment or re duction of risk of K p ylori infection occurs . _ „ . 

62. A method of treating or reducing a risk of H. pylori infection in a subject 
comprising administering to a subject a vaccine formulation of claim 48, such that 
treatment or reduction of risk of H. pylori infection occurs. 

15 

63. A method of producing a vaccine formulation comprising: combining at 
least one isolated H. pylori polypeptide or a fragment thereof selected from the group 
consisting of SEQ ID NO: 74-SEQ ID NO: 146 with a pharmaceutically acceptable 
carrier to thereby form a vaccine formulation. 

20 

64. A method of producing a vaccine formulation comprising: 

(a) providing at least one isolated K pylori polypeptide or a fragment 
thereof selected from the group consisting of SEQ ID NO: 74-SEQ ID NO: 146; and 

(b) combining at least one said isolated K pylori polypeptide or a 
25 fragment thereof with a pharmaceutically acceptable carrier to thereby form a vaccine 

formulation. 

65. A method of producing a vaccine formulation comprising: 

(a) culturing a cell under condition that permit expression of an //. 
30 pylori polypeptide or a fragment thereof selected from the group consisting of SEQ ID 

NO: 74-SEQ ID NO: 146; 

(b) isolating said H, pylori polypetide from said cell; and 

(c) combining at least one said isolated H. pylori polypeptide or a 
fragment thereof with a pharmaceutically acceptable carrier to thereby form a vaccine 

35 formulation. 
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MIKRIAC-ILSLSASLAIiAGEVl^ GFFMGAGYQQG ^YGPYNSNY 

MIKRIAC-ILSLSASLALAGEVT GFFMGAGYQQG lYGPYNSNY 

MKKFFSQSLLAL-IISMNAVSGMDG--^ GVFLGAGYLQG 3AQMHADIN 

MKKFFSQSLLAL-IISMNAVSGMDG — I GVFLGAGYLQG JAQMHADIN 

MARULMKKFVALGLLSAVLSSSLLAEGI GVYIGTNYQLG 3ARLNSNIYNTGDCTGS 



BLOCK A 



BLOCK B 



-SDWRHGN-DU 
-SDWRHGN- 



WGCPPGLTANKHNPGGTNINWHSKYANGALirGFGLNVGYKKF F^QFKSLDMTf KWFGFRV 



GLNFKLGFVGF\N 
DLtGLNFKLGFVGFAN 
SQKQATNATII IGFDALLGYQFF 
SQKQATNATII IGFDALLGYQFF 



BLOCK C 



KWFGARV 
KWFGARV 
KHFGLRL 
KHFGLRL 



YGFLE WFNTSGTEHT KTNLLTYGGGGD 

YGFLt WFNTSGTEHT KTNLLTYGGGGD 

YGFFE YAHANSIKLKNPNYNSEAAQVASQILGKQEINRLTNIADPRTFEPNMLTYGGAMD 
YGFFEYAHANSIKLKNPNYNSEAAQVASQILGKQEINRLTNIADPRTFEPNMLTYGGAMD 
YGLFE YGHADLGKQVY APNKIQLDMVSWGVGSD 



BLOCK D 



LIVNLIPLDKFAI GLIGGVQLAGNTV\ MFPYDVNQ 
LIVNLI PLDKFAL GLIGGVQLAGNTV MFP YDVNQ 
VMVNVINNGIMSL GAFGGIQLAGNSW LMATPSFEGILVEQAL 
VMVNVINNGIMSL GAFGGIQLAGNSVy LMATPSFEGILVEQAL 



-V 
-V 



LLADIIDKDNASF GIFGGVAIGGNTV\ KSSAANYWKEQIIEAKGPDVCTPTYCNPNAPYST 



BLOCK E 



BLOCK F 



TF FQFLWNLGGRlilRVGDRSAFEAGVKFPMV ^QG SKDVGLI RYYSWYV 

TF FQFLWNLGGRMRVGDRSAFEAGVKFPMV ^QG SKDVGLI RYYSWYV 

SKKATS FQFLFNVGARLRILKHSSXEAGVKFPML ECKNPYIT AKNLDIGFF RVYSWYV 

SKKATS FQFLFNVGARLRILKHSSIEAGVKFPMLECKNPYIT AKNLDIGFF RVYSWYV 

NTSTV;^ FQVWLNFGVRANIYKHNGVEFGVRVPLL INKFLSAGPNATNLYYHLE RDYSLYL 



DYVFTF 
DYVFTF 
NYVFTF 
NYVFTF 
GYNYTF 
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MRKLFIPLtJLFSAIiEANEK^K3FFIEalGFETGLLEGTQTQEKRHTTTKOTrY^ 



MRKLFIPLLLFSAIiEANEKNGFFIEAGFETGLLEGTQTQEKimTTTKNTyATYN^ 
MRKJuFIPLLLFSALEANEKNGFFIEAGFETGLLEGTQTQEKRHTTTKNTyATYl^ 



ILKIUANLgTNA£A XSKLKFSSIiSRVHVt.YimffGOLTXENFLPYl^ 



II^KRAANLFTNAEAISKLKFSSLSPVRVIjYHYNGQLTIENFLPVNLNW 



IDIiGVIETIPKHSKXVIiPGEAFDSL KIDPYTLFLPKIEATSTSISDANTQRVFET 

VIETIPKHSKIVLPGETlFDSr-KEAFDKIDPYTFFFPKFEATSTSISDTirrQRVFET 

IDLGVIETIPKHSKIVLPGEAFDSLKEAFDKIDPYTLFLPKFEATSTSISDTNTQRW 
XDXiGVXETIPKHSKIVIiPGEAFDSli KIDPYTIiFLPiaEATSTSISDANTQRVrET 



LNKIKTNLWNYRNEN KFKDHENHWEAFTPQTAEEFTNI^LNMIAVI-DS 

LNNIKTNLIMKYSNENPNNFNTCPYNNNGOTKNIXn^QN 
liNOTiKTNLIMKZSNENPNNFOTCPYNNNGNTKNDCWQNFTPOT^ 

LNKIKTNLWNYRNEN KFKDHENKWEAFTPQTAEEFTNLMLNMIAVLDS 

*■ w** * ***** A ****** 

QSWGDAILNAPFEFTOSPTIXrDNTJPSKCVNPGTNGLVNSKTOQKYVIi^ 
QSWGt)AILNAPFEFTNSSTIX:DSCPSKCVNPGVNGRVDTKVDQQVTI*NKQG 
QSWGDAII^APFEFTNSSTDCDSDPSKCVKPGVNGRViyrKVDQQYIL*NKQGIIN^ 
QSWGDAIIiNAPFEFTOSPTDCDKDPSKCn;KPGTNGLVNSKVIX3KYVr2aKQDI\^ 

***************** **** *»«v««««^ ^« « ^ iTTt * * , *r , «.* *. * 

DLDVIVIiXDSGWGIiGSDITPS^^JDIX3KHYGQLGWASALDPKKLFGDN 

EIDAWIiKNSGWGLANGYG NDG-EyGTLGVEAYALDPKKIiFGlTOLKTlNLEDLRT 

EIDAWLKNSGWGLANGVG NDG-EYGTLGVEAYAIiDPKKLFGNDLKTINLEDIjRT 

DLDVIVLKDSGWGLGSDITPSNNDIXSKHYGQU^WASALDPKKLFGDN^ 



ILHEFSHTKGYGHNGNMTYQRVPVTKDGQVEKDSNGKPKDSDGLPYNVC 

ILHEFSHTKGYGHNGl^YQRVFVTKTCQVEKDSNGKPKDSDGLPYWCSLYGGSN^ 

ILHEFSHTKGYGHWGNMTYQRVPVTKDGQVEKDSNGKPKDSDGDPYNVCSIiYGGSNQPAF 

ILHEFSHTKGYGHNGriKTYQRVTVTKDGQVEKJ^SNGKPKDSIXSriPYWCSLYGGSNQ 



PSNYPNSIYHNCADVPAGFLGVTAAVWQQLIIJQNALPINYANI^SQTNYNLNASL^ 

PSOTPNSXYHNCADVPAGFiy:?VTAAVWQQLlNQNALPINYAMUSSQTNTO 

PSNYPNSIYHNCADVPAGFI/r^TTAAWQQLlNQNALPINYANLGSQTimJLNAS 
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VLKTQKLPLLFVSILyNQSPLrAFDYKFSGVAESVSKvjGFNHSKLNSKEGIFPTATF^ 

VSYDN TDDYYFP R^ GVIFSSYATMSGLPSSGTLNS /J 

. * . ** * 
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__ . . 
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112 
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112 
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TQGG YIFR YNTDDYLPLNSTFYMGGVTTVRGFRNG 

* ★ 



* ★ 



DYKINSKIALKWFSSFGRALAFGQWIRDWYAPIVTEDGRKEV^DGIHAAQLYFSfeKHVQV 



-SITPKDEFGLWLG- 
* * * 



* 
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DGIFTASTELS — 
*★* ★ * 



MPFAYFSPKIYGAPGVKIHIDSNPKFKGLGLRAQTTINVIFPVYAKDLYDVYWRNSKIGE 

VLKAAKMRLAWFFDFGFLTFKTPTRGSFFYN 

** * . * * . . * 

BLOCK D 



WGASLLIHQRI DYNEFNFGF GYYQNFGNANARIGWYGNPIPFNYRNNSVYGGVFSNAITA 
- - APTTTANFJ DYGWGAGF ERATWRASTGLQIEWXSPMGPLVL 



DAVSGYVFGGGVYRGFLWGILGRYTYATRASERSINLNLGYKWGSFARVDVNLEYYWSM 



-IFPIAFFN- 



-QWG- 



80 
112 



HNGYRLDYLTGPFNKAFKADAQDRE NLMVSMKFFF 



GNGKKCKGLC — FN PNMNDYTQ 



BLOCK E 



HFEFSMGTRF 
* * * 
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